її! 


IGUIN MODERN LINGUISTICS READINGS 


=D BY DWIGHT BOLINGER 


Penguin Education ' . : 
Intonation 
Edited by Dwight Bolinger , 


Penguin Modern Linguistics Readings 
General Editor 
David Crystal 


Advisory Board 


Dwight Bolinger 
M. A. K. Halliday 


John Lyons EK ` 
Frank Palmer y" 
James Sledd MS 


C. I. J. M.Stuart 


pes у Y 
Intonation 


Selected Readings 
Edited by Dwight Bolinger " 


gig р 
: Penguin Books 


Books Australia Ltd, 


lished 1972 
i copyright © Dwight Bolinger, 1972 


acknowledgement for items in this volume 
| on page 456. | 


ed out, or otherwise circulated. without 
ler": s prior consent in any form of 
в or cover other than that in which itis 
ed and without a similar condition 
ng this condition being imposed on the { 
uent purchaser 


\ 
б KT. wes жү + 


- үү.) 3 
ate. A. ! S168. 


„(>ы 


Lac. JAD да AN 


414 


"ip oL 


To John Derrick McClure 


ЊУ 
= 
~ 
3 
= 
4 
= 
= 
S 
= 
& 
2 
= 
Е 
S 


ЕЧ 


DEDE 


Contents 


Introduction 11 


Part One 
Preliminaries 17 


1 Dwight Bolinger (1964) ' 
Around the Edge of Language: Intonation 19 


2 Pierre К. Léon and Philippe Martin (1970) WE ~ 
Machines and Measurements 30 


Part Two 
Theory 49 


3 Kenneth L. Pike (1945) 
General Characteristics of Intonation 53 


4 George L. Trager (1964) H 
The Intonation System of American ап English 33 


5 Robert Р. Stockwell (1972) м. | м 
The Role of Intonation: Reconsiderations and other Consi era o 


6 David Crystal (1969) 
The Intonation System of English 110 


7 Dwight Bolinger (1970) 
Relative Height 137 


Part Three 
Intonation and Grammar 155 


8 Pierre Delattre (1966-7) 
The Distinctive Function of Intonation 159 


9 Maria Schubiger (1965) 
English Intonation and German Modal Particles: d- 
А Comparative Study 175 A 


О d of Elements and Sentence Intonation 216 


Philip Lieberman and Sheldon B. Michaels (1962) 
` ne Aspects of Fundamental Frequency and Envelope Amplitude as 
et to the Emotional Content of Speech 235 


'eeci Melody and Song Melody i in Central Thailand 263 


4 eR A. Hall, Jr (1953) 
gar and the Intonation of British English а 


бпаву ep Magdics (1963) 


versality 313 — | 


' Raymond S. Larsen and Eunice Victoria Pike (1949) 
Huastec Intonation 317 


in Hadding and Michael Studdert-Kennedy (1964) 
А! poron Study of Some Intonation Contours 348 


. Part Seven 
Perturbations 365 


22 Ilse Lehiste and Gordon E. Peterson (1961) 
Some Basic Considerations in the Analysis of Intonation 367 | 


23 Werner Meyer-Eppler (1957) NU 
Realization of Prosodic Features in Whispered Speech 385 


24 Nien-Chuang T. Chang (1958) 
Tones and Intonation in the Chengtu Dialect (Szechuan, Chi ina) 3 


25 Einar Haugen and Martin Joos (1952) 1 (Aë 
Tone and Intonation in East Norwegian 414 . 


Part Eight ; 1 
Varieties of English 437 


26 Ralph Vanderslice and Laura Shun Pierson (957) 
' Prosodic Features of Hawaiian English 439 m. 


27 Lorenzo Turner (1949) 
Gullah Intonation 451 
Acknowledgements 456 
Author Index 457 
Subject Index 460 ` 


Oh za Г] 


d 
'] D 
1 Ae ови 


x 


Wary à 
Pus na 


Introduction ag 


| There are two kinds of speech sounds, periodic and aperiodic, which is Е 
| another way of saying musical sounds and noisy sounds. Both are indis- р 
| pensable, but their roles differ. Noisy sounds are mostly limited to the 
complexes that we call phonemes: the explosion of air when the lips : ге 
parted by which a [p] is partially identified, the particular grade of hiss th 
distinguishes an [f] from an [s]. Musical sounds are used in this way too 
particular sets of overtones (called formants) make the difference between 
| one vowel and another, [i] and [e] for example – the higher position of the 
tongue with an [i] creates resonances in the mouth that are not the same 
as with the lower position of the tongue for an [e] or an [a]; the positions. 
do the same for the pulsations of the vocal cords that the stops do for 
pulsations of an organ pipe — they change the quality. In addition, 
mere presence of a musical sound distinguishes a [2] from an [s] – мес 
this ‘voicing’: a [z] is both voiced and noisy: ап [s] is only noisy. ж 
We are not accustomed to thinking of these uses of musical soun 
really musical. They are too unlike our conventional notions of 
But language does make use of another part of the sound wave 
0 like that of ordinary music that it is sometimes called the melod 
Speech. When we sing, most of our musical message is carried not by p x 
Overtones (which are what we use to tell one vowel from another) but 
the fundamental. This is the tone that is identified musically Ж А ог 
C or an F in such-and-such an octave. When we speak we use the funda. 
mental too, and that is called intonation. It resembles music not only in 
its physical basis but in other ways as well — both have ties with emotion 
The chief difference is that music is an art form and is highly elabo 
we insist on exact intervals and exact combinations, and we play а 
of imitative and imaginative tricks with melodies and rhythms, 
cannot afford that degree of originality, for it has to be convention 
has more important business than transmitting feelings, and this fo 


On a question really reflects the 5 у хоп 
ment or interest in getting an answer; but questions аге а grammati 


category, and high or rising pitch is one Way of telling them from 5 

ments, | 
What їз а typical use of intonation in а language like English?  — 
When a layman speaks of intonation he usually means one of two - 

things: E total quality of the sound by which he сап distinguish one 


own sing-song that we are apt to be conscious of it). The second - the 
f voice – comes closer to being purely a matter of fundamental pitch; 
d consider the different sensations one feels on hearing the ‘same’ sentence 
poken in these three different ways: 


н ве 28 
ап Don't °° 
7 s, 8, | 
ап y ` у A 
first is soothing or pleading; the second is assertive — it imposes the 
eaker's will, and is the way commands are usually made; the third is 


ely to be explanatory — it could be in answer to ‘How can I keep 


їп look – the speaker may be expressing something like ‘I’m willing 
nsiderate about this but don't push me too far.' And the mandatory 
ive intonation may be sweetened by a smile. Gesture has the final 


_ can easily invent examples in which this happens and there is almost no 
^ soothing effect at all: 


mon 


^ MN ; 
4 he itch goes down on was а and up on mon-. The difference is that in 
A M А У 

_ Don't be angry the low pitch occurs on the most important word in the 


he low pitch in the middle suggests that the speaker is ‘holding down’ 
something, and it is obviously not the same to hold down something trivial 
- as to hold down something important. Intonation, like everything else in 
guage, is one instrument in an orchestra. 

R T 
12 Introduction 


three examples. Many languages are poorer in that respect but richer in — ` 
others. The sharpest divergence is between ‘tone languages’ and ‘intona- 
tion languages’. The former use the fundamental as part of the system of 
distinctive sounds; a word in Chinese or Mazatec or Yoruba (to name 
three languages in different parts of the world) may differ from another | 
word only in the fundamental pitch with which it is spoken. An ‘ intonation 

language’ lacks this use of pitch but the term is really a misnomer because ` E 
it has yet to be proved that any language is totally without the expressive _ 

uses of pitch that we normally associate with intonation, Pitch is like any 
other clearly audible characteristic of speech sound: it is there to be used, 
and almost certainly will be used, though in varying proportions for _ 
different purposes. Even in English it is possible for a change in fundamen- ` 


von 


tal pitch to make the difference between one phoneme – and hence one | 
word — and another, А vowel following a voiced consonant tends to have SÉ 
lower pitch than one following a voiceless consonant; if the only difference ` А 
between two consonants is that one is voiced and the other not, then we — 
may hear the effect as much from the pitch of the following vowel as from E 
the voicing itself — under some conditions, hearing Zeke and seek, the E 
fact that the vowel in seek has a higher pitch may be a stronger clue than ad 
the voicelessness of the [s]. This use of tone has not become systematic in ~ 
English as in the so-called tone languages, but it is available if needed, — — 
The fact that English lacks *tonemes' like those of Chinese does r iot ч 
mean that pitch in English is not layered, that is, that it cannot do. me 
than one thing at a time. By merely changing the place where ; jitc] 
event occurs, without changing the event itself, we may make a distinction jt 
in word meanings at the same time that we produce a particular intonation | 
contour with its own independent meaning. Take the two sentences 7 Se 
Want to run around and I don't want a runaround, The same intonation — 
а fairly level pitch followed by an abrupt rise followed by a steep fall — 
may be used for both, with a meaning something like ‘I assert this’. At ? 
normal speed they sound exactly alike except for the place where the ; 
and the fall occur — and the difference that this makes is not in the intor 
tion (as one would find between ‘I assert this’ and “I ask this?) but in the 
Words run around, where the rise occurs on -round, and runaround, where 
it occurs on run. The traditional view is that the ‘stress’ falls on erent- 
syllable; but our clue to the position of thestress in this caseis the behavior 
of the fundamental pitch. A 
When it is not distinguishing word meanings or phrase meanings, а — 
change i in the location of the event may — still without changing the basic | 
intonation — single out a word for special treatment. The two expressions | 
help yourself and serve, yourself are synonymous in referring to food, yet 
they are handled differently: У 3 


1. It's time to eat! Don't wait! Hurry up and serve yourself! 
2. It's time to eat! Don't wait! Hurry up and help yourself! 


In the second of these we have to put the jump in pitch on zelt: that is 
> what distinguishes help oneself in this sense from help oneself in ‘Why does 
_ he do those things?’ — ‘He just can’t help himself.’ But in the first, if the 
jump is put on -self the meaning is “Serve yourself, not somebody else’ – 
d the yourself becomes contrastive. 
d And when it is not doing either of these two things, the place where the 
event occurs may make an affective difference. This can happen when it 
does not matter which syllable of a word carries the stress and the speaker 
_ can put it where he pleases. The following was heard on a television com- 
mercial: Do something about that müstache; I don't know why, but I can’t 
- get used to a bald-headed man with a mustache. 
| ! Ѕоа language which uses pitch in this way — an *accent language’ if we 
need a пате for it — is not so very different from а tone language. Either is 
capable of adding an intonation system on top of its accent or its tone. 
There is wide agreement among linguists on the units of sound that make 
distinctions in word meanings. There is no such agreement on the units of 
intonation. Some have argued that an intonation contour consists of a 
. . succession of levels, others that it is a succession of changes in direction. 
y The disagreement reflects the difficulty of treating intonation independently 
of all the other events that tend to colour it. A classic example of com- 
plexity i is the argument over ‘question intonations’. We recognize ques- 
tions as grammatical entities by such characteristics as inversions (He is 
here; Is he here?) and interrogative words (He went there; He went where? m 
Where did he go?). Is the intonation of a question to be counted as part of 
its grammatical identity? If it is, then we may have difficulty deciding what 
to do with a flip answer like: 


WÉI 4 want 
- Becausel 
` given to the question Why did you do it? It is an answer, hence it is not a 
question; yet the rising pitch seems to ask "And what business is it of 
yours?’ or ‘What are you going to do about її?” We may have the same 
. trouble with certain speakers, more numerous in some dialect areas than 
= others, who in giving a long discourse raise their voices every so often, 
p forcing their listeners to give some sign that they are paying attention — 
. their sentences may be statements, but their intonation says *Are you 
- listening? However important intonation may be to what a grammar 
classifies as questions, it seems to lead an existence of its own. The disa- 


14 Introduction 


d greements are the result of our knowing so little about that. existen 
what it means to language as a whole. 


qu 
tions like these was the purpose of bringing together the articles th 
follow. Little is settled but much i is illuminated. The editor's main bo 


sa, E 
ys ei “е n ËM. 
zap qn Hm ps 


Drot ДА чырыл у 2. 
d 6» чш, : . 


Ch PPS, 
m Fa) PNE n ~ 
M A u u^ n" ^ 


dp Low; ZO A s 


ин > , 
C 7 | Za A d Of 
WI "a ASA), 


d Go 


oe Ai 


Part One 
Preliminaries ` 


about it, requires a point of view. The one presented in the first article 
is that languages like English have an accentual system, which is moe - 
Ог less independent of the uses ‘of pitch, to signal attitudes and divisions _ 
of the sentence, and that the latter have the form of configurations ` ` 
(rises, falls and sustains in various combinations) rather than of E 
numbered phonemic levels. There are other viewpoints, as later chap S 
will show, but the advisability of separating accents (sometimes called 

stresses — but this term covers more than pitch contrasts) from the rest 
of intonational phenomena is pretty generally recognized. In any cas 
it is useful to start with because it makes clear that there are layers { 


nineteenth century, estimates of the pitch curve of the voice depend 
on the human ear — even today many intonation studies demand no 
Other instrumentation. A steady fundamental pitch can of course be 
determined by matching it with the known pitch of an instrument, bu 
the normal fluctuating movements need a visual display if they are to р. 
be measured and analysed. In the second article Léon and Martin Ni 
describe the kymograph, a primitive device for producing such a 
display, along with later instruments, capping their study with a 

- . description of their own highly sophisticated and marvellously grap 
Melodic Analyser. It is too new to have been used in the experiment 
Studies reported in this volume, but will profoundly affect future work 
by making it possible to examine large amounts of material without ` 
the need to make tedious calculations. 


= te bj Ze: vi 
E EN T 


len. Te 


Weck а KS “Др р 
р b^ t Ca Е 


Ki is 


кан ek 
Sc, AN 


к. 
ыу: ( d p Sie Ai. "| " Үй | Ee" 


Man "mNT 


1 Dwight Bolinger 


Around the Edge of Language: Intonation 


Dwight Bolinger, ‘Around the edge of language: intonation’, Harvard M 
Educational Review, vol. 34, no. 2, Spring 1964, pp. 282-93. Copyright © 1964. TO: 
by President and Fellows of Harvard College. 5 
d 
WE. 
The surface of the ocean responds to the forces that act upon it in move= N 
ments resembling the ups and downs of the human voice. If our vision — 
could take it all in at once, we would discern several types of motion, 
involving a greater and greater expanse of sea and volume of water: — 
ripples, waves, swells and tides. It would be more accurate to say ripples oz 
waves on swells on tides, because each larger movement carries the smaller Ў 
ones on its back. n 
Suppose our view were limited to a few inches, and our awareness ofthe — 
movement depended on watching the bobbing ofacork. We would Бе соп~ | 
Scious of the ripples, but the rest might escape us as irregularities: some- " 
times the cork would execute its bob at a higher point than at other times, | 
but those high and low times themselves would seem to be perturbed in d 
unaccountable ways. Something to aid our limited view — a tracing, to - | 
measure the distance between peaks, or a clock, to measure the time as  - 
they passed, would help us to separate the overlying and underlying rises RE 
and falls. But even with a clear formulation of the four-tiered hierarchy — 
of movement, our understanding would not be satisfied, we would not feel | 
Secure with it, until we had related each level to something beyond mere 
stirrings of seawater: the ripples with local breezes, the waves with gusts ` 
of wind, the swell with a distant storm, and the tide with the pull of the 
moon and the sun. dE 
In speech (and in song – hence the name ‘speech melody’ to enforce the У 
comparison), the ups and downs аге those of the fundamental pitch of the T 


Voice, produced by the vibration of the vocal cords. Voice, purely as voice, B 
plays many parts in communication. It provides the overtones that are the 
Taw material for vowels; determines the difference between certain con- ` 
sonants and certain others, such as [s] and [z] or [f] and [у]; most impor- 
tantly, it is what gives speech its power to ride over noise and carry long 
distances, Besides these roles — which, though they involve voice and hence — 
tone, could almost as well be monotone – the fundamental pitch of the — 
Voice plays others that overlap in their physical manifestations like the ` 
Motion of the sea. It has taken us a long time to separate the little ups and * 


E 
+ 
4 
$ 
d? 
f 
n 
f 
‘ 
E 


' downs from the big ones; to tell where one stops and another begins; to 
| identity other phonetic events, such as duration and loudness, that are 
associated with them; and to relate each to some separate function in 
f | communication. The work is far from finished, but enough is known so 
/ hat no textbook on language can claim to be up to date if it fails at least 
о call attention to intonation as something whose differing forms from 
- language to language has to be taught. 
Yet intonation is not as ‘central’ to communication as some of the other 
- traits of language. If it were, we could not understand Someone who speaks 
| inam notone; and, in so far as our comprehension of written language is 
| - due to its being a faithful reproduction of speech, we could not read. We 
» therefore must be wary of giving it undue attention just because it is some- 
` thing new. 

How important is it? The answer depends on knowing how extensive 
ће differences аге between one language or dialect and another, and on 
knowing where the cost of misunderstanding comes too high. The place 
to begin is English. 
= J return to my analogy. The ripples are the accidental changes in pitch, 
е irrelevant quavers. The waves are the peaks and valleys that we call 
у . accent. The swells are the Separations of our discourse into its larger seg- 
< ments. The tides are the tides of emotion, ‘ 
И _ The extremes — ripples and tides — are the easiest to describe and the 
= least significant. The ripples are irrelevant by definition. If the first sound 
4 of an utterance is a Stop consonant, say [d] as in the word do, the pressure 
of air that we build up behind it may be such as to heighten the pitch of 
7 the first part of the vowel at the moment of its release, which then drops 
ne slightly. Similarly, even when we aim at a monotone, the first part of an 
| utterance drawing upon the bellows-like pressure from the lungs when it is 

strongest, is higher than what follows, unless we adjust other factors to 
compensate for it. These are involuntary changes in pitch, and there are 
many others. Indeed, since emotion affects us in so many ways, we can 
| detect symptoms of it even here, as in the tremolo that goes with restrained 
|... tears or anger. We can even feign these symptoms, and it might seem that 
d . this makes them part of the communicative System, But somehow we 
discount those particular fakeries as insincere. The emotion that we 
. deliberately put in, or that may be quite involuntary, and is yet respected 
_ asa genuine part of our message, takes a different form: an expansion and 
contraction of the total range of pitch. A surprised оћ, an enthusiastic yes, 

_ огап indignant ло reaches a pitch well above the averageand may sweep to 

а pitch well below it; if thespeaker is bored or indifferent or depressed, his 

range will shrink. Whether in response to a real stimulus or a pretended one 
y and here we dissemble outrageously and systematically-theeffect is thesame. 


Ka 


$ 


. 20 Preliminaries у 


| 


"These facts are so obvious that I mention them only to dismiss them. 
The ripples and tides are probably much the same in all languages. We до 
not need to learn them in a foreign language (though, if we are learning the 
total culture, we might need to learn when to repress them — cultures differ 
in their concepts of decorum, in what outward manifestations of emotion 
are condoned). Our real troubles lie in the waves and the swells. - 1 j 

The waves, as the most abrupt intended movements on the bosom òf è 
Pitch, depend on the relative gradualness of the rest. Ideally, by contrast, = 
the rest is simply a level, inclined or flat, which we may think of as a 
reference line. Suppose we want to say His brother was the one who 
cheated him, and to emphasize just the word brother. This is how it comes ? 
ош: 


bro 
His t 
her was the one Who cheated him 


Someone hearing this might respond with the question 


E? 


His d Qila 
bro ther was the one who cheated him? ES 
|: 
He 
The two sentences are virtually mirror images. In both, the syllable р 1 
Juts out from the reference line, in the one by jumping up, in the other Е 
Jumping down. Here there happens to be a return to (ће same reference т 
line; but sometimes the jump is from one reference line to another: ^ 
| ^ 4 
brother who cheated him? ] 
Tt was his E 
What makes the syllable bro- prominent is its salience in pitch, to which is Y 
Usually added a little extra duration and also, а good part of the time, some о 
extra loudness, Languages that behave this way have an accentual system, А 
һеу Signal the importance of a word by accenting — giving pitch pro- i 


minence to — one of its syllables. Picking just one syllable is for the sake 
of есопоту. The other syllables аге needed for the swells, the larger 
Movements of the reference lines, as we shall see later. For example, ina. 
©ne-word sentence like the following, the single syllable is prominent, but ` у 
the leftovers can tell us whether the sentence is a statement or а question: 4 


. Cre cre LM 
Indis ‘ Indis nS? } 

tions, d ^ 

Many linguists use the term stress for what I have си accen , ог "x 

it more useful to distingui Ski 

Employ the terms interchangeably. I find : Inguish d 


Dwight Bolinger 21 


\ e. A 


{ fren and accordingly I reserve accent for the syllable which actually is 
К highlighted in a sentence — to show the importance of its word — and apply 


is important enough to get one. In the word fanfare, the stressed syllable 
the first; in festoon, the second. While there are certain rough tendencies, 
such as favoring an initial stress in nouns and adjectives and an end stress 
I erbs, stress in English can go anywhere. 
5 There is an element of predictability, however, which is worth our 
3 attention because of its importance to rhythm as well as accent. What we 
can be sure of is that the stressed syllable, that is, the accentable one, will 
- not be a syllable that contains a reduced vowel. In fanfare and festoon all 
vowels are full; except for arbitrary custom, -fare and fes- could be the 
essed syllables. In fancy and fatality the syllables -cy and fa- contain 
duced vowels, characterized by their loss of duration and their uncertain ` ` 
So it happens that a good part of the time the stressed syllable is 
takable whether we accent it or not — in formidableness, for instance, 


бизү? э 
d the first is stressed, because all the rest аге reduced. We can schematize 
s system in three levels: 


(Any stressed syllable сап 
be accented; which ones 
are depends on the intent 
of the speaker.) 


(All long syllables can be 


Unaccented syllable 


Unstressed syllable ^ stressed. Only one, as à 
rule, actually is — this is 
an arbitrary trait of the 

E: ! language.) 
— Long syllable АЕ (Long syllables contain 


full vowels; short syl- 
lables contain reduced 
( ones.) 


ae " H 
. The first and last differences are audible; the middle one is not, Thus in 
- . He's a shoe-box manufacturer, said 

A 


box manufacturer. 


е syllable shoe stands out because it is accented, and box, man- and -fac- 
distinguished by their length and the fulness of their vowels from -4^ 


22 Preliminaries 


о РРА 


* 
м = 


-tur- апа er. But without an accent, -fac-, even though it is the stress 
Syllable of manufacturer, does not stand out from box or man-. 

I must warn the reader that this is not the analysis of English stress he 
apt to find in textbooks. I offer it because I believe it is more accurate and А 
because it more sharply distinguishes the role of pitch, limiting it to the | 
topmost level, that of accent. And it has the further virtue of focusing 
on syllable types rather than talking about ‘weak stress’. This is important 
in learning a language that either lacks the long-short dichotomy or has 
only sparsely represented. Making students pace out a sentence in Frenc 
or Spanish according to a fairly regular beat, instead of turning it int 
fox-trot, is one way of mastering it. The smooth rhythm of successiv lo 
syllables (Which fandango came first ?» is the exception in English, but. is 
often the rule elsewhere. 

Accentual systems involve more than singling out important words Бу 
accenting them. Accents and particular positions of accents become 
Characteristic of sentences. When this happens, an adjustment may be 
Spot. We tend to favor the two extremes of the sentence (or, in me 
Sentences, the two extremes of each relatively independent phrase or clause), _ 
as if to announce the beginning and the end. There may be intermediate _ 
accents, but they are less prominent. This gives the sentence the shape of а 
Битру suspension bridge: 
to 


Y generall ey in Oc 
nerally comes 
Тће ber, 


Here the first accent is on snow and the last is on -/0-, which stand as | the 
Pillars of the bridge. The roadbed is heaved up somewhat on the syllable 
€ar-, and would be heaved up equally on other intermediate accents; ` 
to | 

Snow. month . 


y € e 
d enerally comes У ly in the of Oc 
The E У - ber. 


~ but not ordinarily to the height of the terminal pillars. 
It happens that in this example there is no conflict between the sentenc 
accents and the accents of importance, and no adjustment is necessary. г 
Should we rather say that the language, by putting important things at - 
Opposite extremes, has already made the adjustment for us? m 
But if we rearranged things a bit, we might get Ai 


to 
snow 
comes carly in Oc 
The ; ber, generally. 


importance outweighs the tendency to put an accent at the end, and 


generally simply trails off. (This is not to rule out the possibility of its 
getting its own accent if it is an important afterthought: 


ES ДОУ comes Pat y in od? ве o 
тһе ber, St 


Nevertheless, though an accent of importance will always take command 
if there is a conflict, the tendency to put an accent at the end is often power- 
.. ful enough to shift the stress of a word in order to have its way. I have 


e . recorded examples like the following from speakers in all walks of life: 
i ] | 
$ 
1 fi 
paded to bes 
It enced, 


‘the opposite happens, and one gets the strange pairs 


lute 
I АР шыу del? it DY it abso 


S Ide ly. 


3 A fair number of words (absolutely, 


d pal cannot, nearby, almost, and others) 
у permit this chameleonic shift of stress. It makes no difference to the 
| importance of the word, of с 


ourse, which syllable gets the accent, so long 
.. as one of them does. d 


And there is a special form of adjustment in which two Syllables are 
ў accented. In answer to What was that king's name? the reply might come 


It’s Net had 


таг. 


3 = normal influenced becomes infhienced. At the front end of the sentence 
b 

d 

CG 


va A ed 


with the two accents of the sentence reduced to two accents within a single 
word.* 

I have been speaking all along as if accent and emotion were separable, 

and have gone so far as to push emotion beyond the horizon and called it 

a tide, as distinct from the wave of accent. But jumps in speech are never 

— far from jumpiness. An accent to show the importance of a word ines- 

capably shows its importance for us; it is as if we meant to say ‘This excites 

me’, and left our hearer to infer ‘It’s worth getting excited about.’ So we 

need not be surprised to find two significant emotional overtones in accent. 


3 1. This illustrates the ‘secondary stresses’ which are, actually, the result of just such 
an adjustment: when we pronounce a word in isolation — say it as a ‘citation form’ — 
- we make a sentence of it, and the secondary is a result. 


24 Preliminaries 


| 


G 


One appears in the difference between an accent that jumps up from the 
reference line, and one that jumps down. The upward jump is unrestrained. 
The downward jump is the opposite. The restraint of the downward jump 
lends itself to many shadings — to express comfort, reassurance, doubt. In 
the following example the robust approval of the upward jump contrasts 
with the reservation of the downward jump: to the question What do you 
think of it? comes the reply 


4 з 
ni It's 

«cee 
піс 
The other emotional overtone is simply the use of accents – often repeated 
accents in a single word (even a single syllable) to express great emphasis: 


ce, 


ab i 
I oliy positive. wo 


DL 

The waves of accent need only be large enough to set off the accented 
Syllable from its surroundings — this is why the effect could so easily be 
Confused with loudness: the change in pitch was not radical enough for 
the hearer to sense its direction. The swells of separation are different, 
Their movement is necessarily wide. Indeed, the greater prominence of 
accents at the extremes of the sentence that we have just noted might 
better be put down as a manifestation of separation: the wave is there, 
but augmented by the edge of a swell that coincides with the sentence itself, 
to separate it from other sentences. ‘Here’, it seems to say, ‘is where the 
Sentence begins; and here is where it ends." j р 

Because interruptions are always jarring, the most conspicuous instance 
Of separation is parenthesis. Once again the partners of pitch are duration— 
Now in the form of a pause or of the kind of lengthening that substitutes 
for a pause — and loudness. The pitch level of the entire parenthesis is 
lowered, the volume is reduced, and the extremes are set off by'pauses, 
though the advantage of relying more on pitch is that one needs to rely 
less on pause, and speech is accordingly not slowed down unnecessarily, 
In the following example, there is not only a lowering of the parenthesis 
itself, but a certain amount of raising in the environment of the parenthesis, 
to make the contrast all the clearer: 

tim chi 

des main „Sources appear to be — cle, а Чу 


T, 
bÊ’ its 


si 


Be 


and 9i 
m, 
chewing gu " | 


SUD tance used largely in i 


Dwight Bolinger 25 


t were not for the intervening parenthesis, the last two words, and oil, 
uld have been at a considerably lower pitch (lower them here, and you 


m utterance overlap with those of the parenthesis — the range at our 
disposal is not wide enough to admit of a complete wrenching apart — 
but there is no mistaking the contrast in levels. The accents continue to 
. . appear where they normally would, but in the parenthesis they are flattened 
." somewhat. Though it has nothing to do with the parenthesis, we can also 
see in this example the use of relative height for relative importance 
. among the accents themselves, a characteristic of English that is still to be 
d explored: the -sides of besides clearly carries an accent, but that of tim- 
goes higher. 
d ` More typical and far more frequent than separations for parenthesis, 
- where the lowered pitch suggests a lower-ranking element,of the discourse, 
separations that divide equal-ranking elements from one another. 
‘ossest form these are of two kinds: sweeping rises and sweeping 


р ‚ге 
took out пі 
his DE k dad slashed awa 


» y 


the sweeping rise on knife separates the two clauses. The steep fall on -way 
Separates the sentence from what follows, but there is not really the sharp 


Kä 


| не took out his trust. kni slashed „wa 


fe and у. 


_ As with accents, a greater or lesser breadth of movi 


Я. 2 5 ement may establish 
^ i] a hierarchy of importance. In uh: 
Ki UI 


"LE à nim 
7 Ge m 
mu still around when you hear fro let me kn o 


у To still around 


X hear from him let me kno 


when you 

| w. 

Xf the extent of the rises (with or without similarly differentiated pauses) tells 
whether the when clause is to be taken with Jf Im still around or with let те 


Preliminaries 


4 d 


Simple rises for separation are common enough in English — for instance, 
they are almost always used with gnomic expressions like Easy соте, ei 
во, Out of sight, out of mind — but they are apt to sound somewhat flip 
ordinary discourse: 


it 
knows what he can d 


If he doesn’t like e $ 


Simple rises — rises that once started do not drop back — аге not only 
common, but are the rule, at least in French, Spanish and German, with 


The typical more nearly neutral shape in American English is different. In 
Place of merely gliding up, it first goes up, then down, then up again but- 
not very far — a rise-fall-rise:? 


ter 


i ear y, in ol 
In the in T the SNOW generally comes ly in Oc 


ber. 


y ZE 
The shape is particularly noticeable when unaccented elements are added ` 
after the accent, prolonging the rise-fall-rise into a broad undulation: 


ter j 
10 


e r ‚ 
Tn the in 
M oc 5..00 
3 prov! D 
3. This is a device that enables English to make а sharper distinction between а 
Dons and non-questions. The simple rise is used for all forms of incompleten SS, 
including interrogation. The rise-fall-rise is incomplete, but its use in questions is — 
extremely limited; in fact, it is used far less than is a straight fall: — _ ? 


yes 


Wasit ter 
NH day that they came? 


15 à normal informed question; 


Was it У ter A 

day that they са" к. 

Would hardly be used except to repeat ће speaker's own question asasortofadmoni. — 

tion, or to repeat the interlocutor's question in а EE Where a ` А 
Separation comes in the middle of a question, the rise-fall-rise is avoided i 

Precedes is itself properly a question. Thus in When you don't get exercise, do you h 
E n mal on exercise; but when the clausi 

а good appetite? the rise-fall-rise is norma i Ў eS 

Teversed, it is not normal on appetite. It is пора) a рөн айегпа ivi 

(РИ eit Ру “11 go to bed — on read awhile), but not between questior 
either read awhile, or I'll go ill you go to bed?). ina 


alternatives (Will you read awhile, or W. 


Тће simple rise does more than mark a separation, of course. Since it is 


d 
_ Clearly set apart from the fall that marks the end of a statement, the rise 
— Signals a separation at a, point of incompleteness. This leads to incom- 


pleteness at a further remove, though the kinship is still apparent: 


T ERU LS zt? 
aen" eat „је“ Then 


Ba The relationship between the incompleteness of the if clause and that of 
- the question with do adds a third link between such clauses and questions, 
_ which are already tied together by the possibility of inversion (Were I 
. you . ..) and by the sharing of if (I don’t know if ...: indirect question). 

_ Questions are of course the prime examples of unresolved utterances, 
whose Tesolution awaits their answers: there is no great difference, 
— intonationally, between an unresolved clause and its resolution when in 
the mouth of a single Speaker, and when in the mouths of two. But it 
must not be thought that the rise is a pure grammatical symbol for inter- 
Я rogation, for questions neither require it nor monopolize it. Other forms 
of incompleteness are of everyday occurrence. In the following, the aim is 


simply to leave the hearer in Suspense, as if to say e 
l the hi А ft Imagine the conse 


K 


e glad 


1 Ы еу 
m that € ages don’t come €V ег NICE 


tur? 


- In the next example, which is a petula 
| dt ple, nt reply to ЈИ 
y, incompleteness implies ‘What business is kd E jd a Gr 


mL M 


And after a low-pitched accent the rise seems to im 
assertiveness; which helps to sive this contour sis 


Don't TII 
| wor? у, help У ° 


ply merely an absence 
Asie sf ecimplaisanee: 


-... The fall is typically terminal, but from literal ‘conclusion’ it has passed 
.. toa figurative 'conclusiveness' and may occur anywhere; p 


in nev nev Sa 
You must d у 


d ег, ег, 


things like that, 


_ The deeper the fall, the more conclusive; just as, with the rise, the steeper 
__ itis the more inconclusive it is. As with accents, this proves once more how 


_ 88 Preliminaries | r У 


АГ / 


difficult it is to separate emotion from other functions ofi locaton 
are right when we stigmatize a monotone as ‘lifeless’. Intonation 
half-tamed servant of language. The rise and fall can be thought o 
grammatical signals of completeness and incompleteness, or as emoti 
gauges of tension and relaxation. Adding intonation, we turn each 10 
message into an act of will. ^ 
If this were a treatise on intonation, and not just a rapid survey t to 


convincing, we could go on for a hundred pages more to consider ‘the 
nuances of accent and separation. We would take into account the ра 
terning of accents in ordinary commands, each lower than the last, Be 
Mountain range flanked by descending foothills: i 


Cope. 


We would compare the similar patterning, but differing pitch level, of 1 
questions introduced by interrogative words, and statements: 


how БО 


But did you get home 


bu 
me? Трој on the 


We would examine the ties between accents and grammatical cona E 


Such as the passive; or between separations and the use of extra word; 
make them possible, as in the following, which answers Who would 


n un ` EC NY 
ње or Joll LM TRE, d 


7 Where if we are positi he is enough, but if we are tentative, and y 
а Hise-fall ise, the extra Word is needed (o cover the Sep 


undulation, The ramifications are legion. ` 


Kx Dwight Bolinger 29 


7 а k 
i E ^ 
THAN d vk SURE УУГУ Т. ИА. 


Ka 
fi T Pierre R. Léon and Philippe Martin, Prolégomènes à l'étude des 
tructures intonatives, Marcel Didier, Montréal, Paris, Bruxelles, 1970, 
pp. 85-97, 172-80. Translated for this volume by Susan Husserl-Kapit. 
her 


+, 


One of the first measuring devices in instrumental phonetics was the 

Xymograph described by Rousselot (1897-1908) in his Principles ој 
imental Phonetics. 

The principle of the apparatus (Figure 1) is simple. The sound waves 

of the word are transmitted by a rubber tube to a drum, which is caused 


у! ; А 
to vibrate. А recording stylus mounted on the drum inscribes the vibra- 
ons on a sheet coated with lampblack attached to a cylinder revolving 
Кр constant speed, 
de, ` 
30 Preliminaries ; ; 


И т 
The drum, acting as а low-pass filter, fails to pick up tine higher har- 
tonics of the sounds of the word. The resulting curve brings out at most 
the first and second harmonics. A curve like that of Figure 2, rich in 


AAW 


Figure 2 Curve rich in harmonics (made by oscillograph) 


harmonics, becomes – when recorded on a machine as rudimentary as the 
kymograph – a trace like that of Figure 3. 


AM 


Figure 3 Kymographic curve. The harmonics do not generally appear 
A kymographic tracing therefore does not suffice to analyse the dis- 

tinctive sounds of the word (its phonemes), but it is good enough to study — 

the three prosodic parameters, duration, intensity and pitch. Following is 

а kymographic tracing of the sentence L’horizon tout entier s'enveloppe ` 

dans l'ombre (Figure 4). 


PLN M MM IM Us AD Me p ae 
i a 
t 1 ў 


tut ü 


ЗАГА ion tous pz үз diat Џ 5 bor 


Figure 4 Kymographic recording of the sentence *L'horizon tout entier 
s' enveloppe dans l'ombre" 


„Duration. This is measured in hundredths of a second. It can be easily 
Calculated from a kymographic tracing when the speed of rotation of fas y 
Paper is known. 


Intensity. The amplitude of the resulting curve can be measured on’, 1 
kymographic tracing by using an intensity metre. Given the inertia of th 


r Wu rubber membrane of the recording drum and the resulting frequency 
response, differences in intensity show up rather poorly on a kymogram. 
Rousselot calculated the intensity of sounds just by measuring the 
amplitudes in millimetres. 


Pitch. Pitch can be determined by calculating the frequency of each 
phone. To do this it suffices to note how many vibrations there are during 
Р its emission. If the phone is of brief duration, say around fifty milli- 
|... Seconds, it is enough to allow for a single average frequency. But if the 

phone is long (for example a long vowel), it is necessary to average the 
frequency every fifty milliseconds so as to catch t| 
pitch, as in the example below (Figure 5), 


t n 50тз ,. SOMS 50ms 50ms 50ms 50ms 50ms 1 
Wa lk 


he possible changes in 


6dv bdv БЕТ 5dv 4dv 4dv 3dv 


120Hz  100Hz  100Hz 100Hz 80Hz 80Hz 60Hz 


Figure5 Kymographic display showing vibrations and frequenc: lations. 
h In the first50 ms sequence 6 double vibrations are visible, The А d 
61100 = 120 Hz gives the pitch ofthe note, 


(or 120 cycles per second) 


which corresponds to 120 Hz 


It is especially the pitch of the vowels that co 
intonation, and it is often pointless to comp 
‘consonants, as some researchers have done, 

One needs a good magnifying glass, good е 
patience to make out the intonation even of a sim 
method, which phoneticians regularly employed 
instrumental phonetics. 


unts in the perception of 
ute the vibrations of the 


yesight, and plenty of 


ple phrase by using this 
in the heroic days of 


32 Preliminaries 


Kymographic studies 
Nevertheless, a good many important studies were carried out with ће у 
kymograph at the turn of the century – and not a few even of fairly recent _ 
date. Among the most important are the ones listed below: 


BracH, D. M. (1938), The Phonetics of the Hottentot Language, Heffer, Cambridge. — 
САМА,К. (1939), ‘A Study of the native Hindustani melody pattern and the М 
acquired English melody pattern with special reference to the teaching of English —— 
in India’, Arch. Néerl. de Phon. Expér., vol. 15, pp. 103-10. 
CANELLADA, M. J. (1941), ‘Notas de entonación extremeña’, Rev. filol. esp., 
Vol. 25, pp. 79-91. 

Daan, J. (1938), ‘Dialect and pitch pattern of the sentence’, Proc. 3rd Int. 
Congr. Phon. Sci., Ghent, рр. 473-80. : 

Git Gaya, S. (1924), ‘Influencia del acento у de las consonantes en las curvas de ` | 
entonación’, Rev. filol. esp., vol. 11, pp. 154-77. ч 

MAACK, А. (1957), * Verzerrungsfreie Melodiewinkel aus der Tonhéhenkurve’, 
Phonetica, vol. 1, pp. 206-15. 4 

м ^GDICS, К. (1959), ‘Intonation of the Hungarian settlers from Bukovina’, 
Acta Ling. Hafn., vol. 9, pp. 187-227. 

MaLHIAC, Н. (1953), Analyse et enregistrement de la voix parlée et chantée, 
Société Générale d'Impression, Toulouse. | 
RousszLor, P. (1924), Principes de phonétique expérimentale, 2e éd. Tomes 1 et 24) 
Didier, Paris, d i 
SCRIPTURE, E. W. (1902), ‘Studies of melody in English speech’, Philosophische 
Studien, vol. 19, pp. 599-615. 

Séouy, J. (1953), ‘Un combiné magnétophone-électrokymographe en vue de 
l'analyse tonométrique’, Orbis, vol. 2, pp. 518-20. 

SÖDERGARD, О. (1957), * L'intonation syntaxique еп français’, Stud. Ling., 
Lund, vol..11, no. 1, pp. 92-120. 1 
WÄNGLER, H.-H. (1963), Zur Tonologie des Hausa, Schriften zur Phonetik, 3 
Sprachwissenschaft und Kommunikationsforschung (6), Akademie-Verlag, Berlin, 
Zwirner, E., and Zwirner, К. (1936), Grundfragen der Phonometrie, Berlin. 


Al 


Oscilloscope and oscillograph s 4 
The oscillosco kesit ible to represent the sound waves by а curvi 
ре makes 1t possi X 

Produced on the screen of a cathode ray tube, through the movement ofa. 
luminous spot created by the impact of electrons on a fluorescent coating. 

anks to the weak inertia of the moving elements — the electrons — one 
can examine the swiftest and most fleeting phenomena on үс ofan | 
Oscilloscope, Some instruments have а passband of 2000 уе TU 
thousand million cycles per second. The frequencies of vocal Sounds are 
between 0 and 10,000 Hz. It is therefore possible, even with а very inex- ` 
Pensive oscilloscope, to make very precise analyses in acoustic phonetics, 
GH У 


. Pierre R. Léon and Philippe Martin. 


we, o 


. 34 Preliminaries 


баш pe 


To get an oscillogram, one may either film the luminous spot or use a 


= recording instrument called an oscillograph. 


Figure 6, below, gives an idea of the richness of the oscillographic 
curve by contrast with the kymographic curve (Figure 4 above). The 
oscillographic curve makes it possible, theoretically, to study the timbre 


fon 


"ч | 


wn UL 


G dM voe" үлчә po дај бе 


Figure6 Oscillographic recording of the sentence ‘L'h r 
ori 
S'enveloppe dans l'ombre' CL RUM 


of the phones recorded. Actually, the analysis that has to be carried out is 
Still too complex and for this kind of study another instrument is used, the 
spectrograph (see below). L x 
In short, the use of the oscilloscope in experimental phonetics is the 
same as that of the kymograph where the purpose is to analyse the three 
parameters that are most important to the prosody: duration, intensity 
and pitch. But the oscillograph is superior to the kymograph in precision. 
: It provides for more paper speeds (Figure 7). Higher speeds make it easier 
to count vibrations in order to study frequency, Finally, the scaling of the 


Figure7 Two oscillographic recordings of the same sound with different 
speed settings (above, 100 mm/s; below, 1000 mm/s) 


Ll 


paper and the possibility of amplitude settings makes the calculation of 
amplitudes easier and safer. 


Oscillographic studies 

The oscillograph is one of the most widely used instruments in modern 
phonetics laboratories, even though using it to measure pitch is as tedious 
аз using the kymograph. It is hard to demarcate units, to pinpoint vibra- 
tions, etc. 

Among modern studies whose authors indicate their technique of 
oscillographic analysis to investigate prosodic phenomena the following 
can be mentioned, among which that of Burgstahler and Straka (1964) 
Seems both the most useful and the most clearly explained from а 
methodological point of view. | 
Boupnzaurr, M. (1968), Rythme et mélodie de la phrase parlée en France et au 


Québec, Les Presses de l'Université Laval, Québec et Librairie C. Klincksieck, 
Paris. 


BURGSTAHLER, P. and STRAKA, С. (1964), *Étude du rythme à l'aide de 
l'oscillographe cathodique combiné avec le sonométre’, Trav. Ling. Litt., 
рр. 125-41, 

Deva, B. C. (1960), ‘Psychophysics of speech-melody’, 2. Phon., vol. 13, pp. 8-27. d 
Ногрек, М. (1968), ‘Etude sur l'intonation comparée de Іа phrase énonciative 

еп francais canadien et en francais standard’, in Recherches sur la structure К 
Phonique du francais canadien, Р. R. Léon (Studia Phonetica Г), Didier, Montréal, 

Paris et Bruxelles, pp. 175-191. 

JASSEM, V. (1959), ‘The phonology of Polish stress’, Word, yol. 15, pp. 252-69, { 
LEDEBOER VON WESTERHOVEN, L. Е. (1938), ‘Melodie und Tonbewegung im 
Niederlandischen’, Proc. 3rd Int. Congr. Phon. Sci., Ghent, pp. 489-96. | 
Léon, P. В. р, В. А. (1969), ‘Deux interprétations du “Pont М 
Мне Sg et де GE Phonetica, vol. 19, рр. 82-103. 
MALMBERG, В. (1940), "Recherches expérimentales sur l'accent musical du mot 

en suédois’, Arch. Néerl. de Phon. Expér., vol. 15, pp. 62-76. 

Parmenter, C. E., and TREVINO, S. №. (1930), 'L'intonation italienne’, 

‚ Jtalica, vol. 7, pp. 80-84. 19 as : ~ , 
РАК „ S. М. (1932), ‘A technique for the analysis 
of pitch ia E РЕА po Néerl. de Phon. Expér., vol. 7, pp. 1-29. 1 
Roninson, L. (1968), ‘Etude ди rythme syllabique en francais canadien et en А 
francais standard', in Recherches sur la structure. phonique de francais canadien, 

P. R. Léon (Studia Phonética I), Didier, Montréal, Paris et Bruxelles, pp. 161-74. 
VARDANIAN, R.-M. (1964), ‘Teaching English intonation through oscilloscope 
displays’, Lang. Learn., vol. 14, nos. 3-4, рр. 109-17. 


The Spectrograph SÉ d Ў 
Тһе Spectrograph is a much more versatile instrument for analysis than the Sos 
oscillograph. Many technical descriptions of the machine and its use are 


^ 


Pierre В. Léon and Philippe Martin 35 
5 


to be found. The most important are those of Potter, Kopp and Green 
(1947) and Martin Joos (1948). This apparatus, marketed by the Kay 
Electric Company, is essentially a spectral analyser into which a recording 
(по more than 2-4 seconds long) can be fed to obtain a spectrogram rep- 
aesenting the harmonics of the sound on a frequency scale of 0 to 8000 Hz 
(narrow-band setting, Figure 9). Intensity is shown by the greater or lesser 


b "degree of blackness of the harmonics on the spectrogram. The overall 
- . intensity is also shown by a linear or logarithmic amplitude display (see 
/ also the figure below). 


а 5 
‘Figure 8 - 


36 Preliminaries 


their variations in frequency shown in the undulating movements of the ` 
horizontal lines. With a spectrum of this type the changes in intonation can Ki 
be observed by following the curve of the fundamental. However, since ` n 


/ xr 


Figure 9 Narrow-band spectrogram of the sentence ' Vous aimez les 
escargots?' Duration is shown in ms on the abscissa, frequency (in this 
illustration) from 0 to 5000 on the ordinate. The jagged line at ће top 
marks changes in intensity in db 


tion and errors of measurement are liable to be serious, it is better to take 
some higher harmonic as the basis for measurement. In the sentence of E 


easiest to follow with the eye, has the following values: 


[vu ze me le zes kar gol ` | 
181 187 275 250 219 156 300 ‘> 


By measuring each of the other harmonics one.can prove the simple A RRG 
‚ relationship that exists among them. If the tenth is at 1500 the first (or 


fundamental) will be at 150 D a the second at 300, the third at 450, 


the fourth at 600, the fifth at 750, the sixth at 900, the seventh at 1050, the ` 
eighth at 1200, the ninth at 1350. If for any reason whatever (noise, 
filtering, etc.) the fundamental is absent from the spectrum of a vowel, = 
Опе can always deduce its pitch from that of any two consecutive nd e 

monics. By the same token it is advisable to calculate the frequency from ` 
the tenth harmonic, for example: the chances of error in measuring the ` 


fundamental are divided by ten. 


- There exists another type of spectrogram using a wider frequency scale. 
_ A frequency amplification is selected, say ten, and one obtains a spectrum 
ten times as wide for a given range of frequencies (the time scale does not 


‘Figute 10 Spectrogram of the same sentence as Figure 9, showing the lower 
armonics spread out 


i Бакон 
Figure 11 Wide-band spectrogram of the same sentence as Figures 9 and 10. 


.. Itis comparatively easy to calculate the number of vibration ical 
B striations) of the low-pitched sounds but harder to SEE one 


ka Р 
Spectrographic studies 

Most modern phonetics laboratories have а spectrograph and the number 
d of studies using it are quite numerous. Improvements in technique hav? 
. greatly reduced the time required to make а spectrographic analys!® 
d . Nevertheless the procedure is slow and melodic analysis using the appa!" 


` $8 Preliminaries 


ЕШ 


as is the case with melodic analysers (see the next section). If the spectro- 
graph continues to be widely used for melodic analysis, it is mainly because 
of its reliability. With a little practice it is almost impossible to go wrong 
in calculating harmonic frequencies. 

The following studies are analyses of prosodic phenomena, especially 
intonation; some contain explanations of the techniques used. 
Bouncer, D. L. (1951), ‘Intonation: levels versus ‘configurations’, Word, 
vol. 7, pp. 199-210. 

CRYSTAL, D., and Quirk, R. (1964), Systems of Prosodic and Paralinguistic 
Features in English, Mouton, The Hague. 

DELATTRE, P. (1961), ‘La leçon d'intonation de Simone de Beauvoir, étude 
d'intonation déclarative comparée’, French Review, vol. 35, pp. 59-67. 

DELATTRE, P. (1963), ‘Comparing the prosodic features of English, German, 
Spanish and French’, /RA LL, vol. 1, рр. 193-210. ; 
DELATTRE, P. (19662), "Les dix intonations de base du frangais’, French Review, 
Vol. 40, no. 1, pp. 1-14. М 
DELATTRE, Р. (19666), ‘A comparison of syllable length conditioning among ! 
languages’, ТК AL, vol. 4, no. 3, pp. 183-98. 

DELATTRE, P., POENACK, E., and OLSEN, C. (1965), ‘Some characteristics of 
German intonation for the expression of continuation and finality’, Phonetica, 
Vol. 13, pp. 134-61. 

FANT, G. (1961), ‘Sound spectrography", Proc. 4th Int. Congr. Phon. Sci., 
Helsinki, pp. 14-33. M 
Faure, С. (1961), *L'intonation et l'identification des mots dans la chaine 

Parlée (exemples empruntés à la langue francaise)’, Proc. 4th Int. Congr. Phon. 

Sci., Helsinki, pp. 598-609. ~ 
НАКРМАМ, J. M. (1966), Jaqaru: Out 
Structure, Mouton, The Hague, cf. рр. 26-8. 

JAssEM, V. (1959), ‘The phonology of Polish stress’, Word, vol. 15, pp. 252-69. 


KALLiOINEN, V. (1968), ‘Suomen Kysymyslauseen Intonaatiostav" (Remarks оп 
the intonation of interrogative sentences in Finnish), Virittaja, vol. 1, pp. 35-54. 
Lensre, I. (1961), ‘Some acoustic correlates of accent in Serbo-Croatian’, \ 
Phonetica, vol. 7, рр. 114-47. 

Léon, P. R. (1967), ‘La joncture extern! 
Phonologie der Gegenwart, рр. 298-306. ECH S 
MALMBERG, B. (1961), ‘Analyse instrumentale et structurale des faits d'accents’, de 


Proc. 4th Int. Congr. Phon. Scis Helsinki, рр. 456-15. P 
Мозт, Н. (1959), ‘Duration of speech sounds in Estonian’, Orbis, vol. 8, рр. 213-23, 
Reuper, Р. (1968), Beitrage zur Erforschung der Serbokroatischen Prosodie, 

Verlag Otta Sagner, Munich. Е 
SAPON, S. (1958-9), ‘Etude instrumentale de quelques contours mélodiques | 
fondamentaux dans les langues romanes’, Rev. filol, esp vol. 42, pp: Пе, | 
Suen, Y., CHAO, J., and PETERSON; С. (1961), ‘Some spectrographic light on 
Mandarin Tone 2 and Tone 3°, Stud. Sounds, vol. 9, pp. 265-314. 


\ 


line of the Phonological and Morphological 


e en français: nature et fonction’, 


Y 


Pierre R. Léon and Philippe Martin 39 


SHIMAOKA, т. (1966), “А contrastive study on rhythm and intonation of English 
сапа Japanese with spectrographic analysis’, Stud. Sounds, vol. 12, pp. 347-62. 


__БРЕАЕ5, В. А. (1966), ‘A note on the tone of Maninka substantives’, J. Afr. 

© Lang., vol. 5, no. 2, рр. 113-20. 

| SZMIDT, Y. (1968), ‘Etude de la phrase interrogative en français canadien et en 
francais standard’, in Recherches sur la structure phonique du francais canadien, 

_ P. R. Léon (Studia Phonetica I), Didier, Montréal, Paris et Bruxelles, рр. 192-209. 

WEINREICH, U. (1956), ‘Notes on the Yiddish rise-fall intonation curve’, in 

M. Halle et al. (eds.), For Roman Jakobson, Mouton, The Hague, pp. 632-43. 


" Wong, Н. (1953), ‘Outline of the Mandarin phonemic system’, Word, vol. 9, 
- pp. 268-76. 


d 


d The melodic analyser of the University of Toronto 


In YED general terms, the system consists of a series of four Tchebycheff 
| "lee anging from 70 to 500 Hz, a computer program, and а sub-program 
for correcting such things as ‘jitters’ and * misses’. Once the speech signal 
starts, the computer examines each channel to detect the location of the 
fundamental frequency. In other words the computer tries to detect in 
which channel the fundamental frequency is being filtered and, by com- 
paring the values that have gone before and the values that come after the 


. One that is being examined, it tries to be sure of extracting the funda- 

. Mental frequency and not the second harmonic (James, 1970 p. 170). 
The illustration shows the television screen of t Sp 

to teach intonation. 


he analyser being used 
The model pattern (upper half) like that of the 


Student's imitation (lower half) can be retained on the screen as long as 
desired. Either pattern can be erased at will. The fundamental pitch curve 
is produced on the screen in real time, i.e. is traced as the speaker speaks. 

The following sections outline the results obtained with the Melodic 
Analyser. 


Vowels 


The analyser responds perfectly to vowel signals, which are the elements 
of melodic perception. But it is superior to other analysers in that it always 
Bives an accurate indication of the fundamental frequency. It is a known 
fact — especially with the vowel [u] — that the second harmonic is often 
more intense than the first. This explains the erroneous results in the 
response of classical analysers when it comes to analysing a sentence like 
Vous nous avez tous vus dans la rue? 


ка 


Lef РОР ri Р 
V uh u z avs ува 


Figure 13 Spectrogram of' Vous nous avez tous vus dans la rue?' 


By comparing the curves obtained for this last sentence as analysed by ` 
the spectrograph and as analysed by our apparatus (see below), we see ` 


that the response of the latter is in keeping with the curve obtained on the 
Spectrograph. 


Noise 

It is possible to distinguish background noise and consonant noise, both 
of which tend to cause erratic responses that interfere with vowel record- 
ing, 


Background noise 


Our instrument, like all others making a partial SE) STEN) RS is pere 
turbed by intense noise. A tape made of a conversation on a Paris street 


1-3 ^ Pierre R. Léon and Philippe Martin 41 


. Figure14 Oscillogram (1), intensity curve (2), and intonation curve (3) 


b tained on our melodic analyser for the sentence ' Vous nous avez tous vus 
. danslarue?' 


^ 


ти 
nëtt, 


‘igure 15 Fragment of a sentence taped ata high noise level. The intonation 
curve is perturbed 


л 


at a level of 8 db more or less above background noise produces an intona- 
tion curve that is difficult to interpret correctly. Here is an example: 

With less intense noise, and given signals 20 db or more above back- 
ground noise, our analyser is particularly resistant to interference. Here is ` 
а recording made in our laboratory at the same time that an adding 
machine, fan, etc. were operating: , 


ТҮҮ ж 


(3) 


Figure 16 In spite of the noise (visible on curve 1), the intonation curve (3) 
appears clearly 


Voiceless consonants 

(a) Thus far we find that in initial position, voiceless fricatives do not 
Cause any deviation from the general melodic curve. The stops, however, 
Teveal a jump in frequency from 10 to E 


(b) In the intervocalic position, voiceless consonants are characterized by 


ап average jump in frequency of about 5 to 10 Hz. 


(c) In final position, these consonants produce no perturbation of the 
Curve, 
~ 


Voiced consonants 
(a) In initial position, voiced stops begin at a frequency of 10 to 20 Hz 


below the level of the following vowel. ; 
The voiced fricatives show a concave pattern descending from 10 to 15 


Hz below the level of the following vowel. 


Pierre R. Léon and Philippe Martin 43 


As for the nasals, [m], [n], [n], they remain at the same level as the 
_ vowels. The liquids [1] and [r] begin from 15 to 20 Hz below the vowel 
curve. 
.. (b) In intervocalic position, the voiced stops show the same pattern as in 
initial position, i.e. concave, from 5 to 10 Hz below the following vowel. 
Voiced fricatives [v], [2], [3, more periodic and sustained than the 
` < corresponding stops, are marked by a concave fall from 15 to 20 Hz. 

d The nasals, being the most periodic of all the consonants, as a rule 
blend smoothly in the melodic continuum of the vowels. Sometimes they 
are marked, however, by a slight fall from 5 to 10 Hz below the neigh- 
M bouring vowels. The liquids [1] and [r] lower the curve, in general, from 

^. $to 10 Hz. 

(©) In final position, the voiced consonants are plainly visible on the 
intonation curve. The stops [b], [d], [в], followed by a release or by a true 
.... Schwa, show the same characteristic trough. When there is no release, the 
| curve descends from 15 to 20 Hz before returning to zero. The same holds 
fue for fricatives, 

. . Final nasals prolong the curve at 

vowels. The [I] behaves like the n. 

tives, voiced or voiceless, 

In general the consona 
Di . Curve, They are either too 


Figure 17 shows the analysis of the sentence Je n'ai pas très bien mangé 


hier soir, as done by our analyser, The various interruptions of the curve 
that have been pointed out are readily visible, 


Octave shifts 


* Analysers currently on the market have to be readjusted each time th 

E: voice shifts octaves. This hampers any study of expressive style, where 
speakers generally use an expanded range, as in the sentence analys! 

below: Est-ce que c'est beau? 1 


On a scale of 70 to 500 Hz our analyser permits 
= аѕ the preceding curve demonstrates, This flexibili 
analyse expanded ranges without difficulty, 


all possible variations 
ty makes it possible t° 


Sudden rises 


In the case of a very rapid rise, neither the classical analysers nor th? 
spectrograph reacts fast enough. In the example below, the same sentenC? 


_ 44 Preliminaries 


Figure 17 Oscillogram and intonation curve ofthe sentence ‘Je n'aipas ` ` 
trés bien mangé hier soir’ PN 


j ЖОЖ ` , 
Figure 18 hows the melodic variations ofthe expressive 
ў OS CARA ct S S Gas 


RT | 5 1 
сап be seen analysed on the spectrograph and оп our analyser a 
intonation has made an abrupt jump of 200 Hz in 100 ms. The ме E 
4 graph missed the analysis of this part of the curve but our analyser rende: 

oit effectively. 


ди. Ee ТИР ЧА 


F — 
ТЕ 5 3 CA SNS rm te 


I e 

Figure19 Spectrogram of the expressive sentence; 
_ The very high note of the first syllable of 'bonjour' г 
. delay can be evaluated by comparing with Figure 20 


We ' 
‘Bonjour! cher Monsieur. 
ises only gradually. This 


150 


Figure 20 Oscillogram and intonation and intensity curves of the sentence 
_ "Bonjour! cher Monsieur" on our analyser, This sentence is the same as | 


E: of Figure 19. The sudden rise shows clearly 


Precision 

The scale of our intonation curve affords great precision. Even by taking 
the tenth harmonic, often difficult to read, or by using the ‘scale magnifier’, 
a spectrogram falls far short of the precision — down to one hertz – possible 
with our analyser. . 

Figures 13 and 14 show this difference clearly. Note the broad line of 

the spectrogram corresponding to the pass band of the narrow band filter 
With which one must estimate the frequency. This leads to a considerable 
lack of precision. All the slight variations of frequency within a vowel are 
lost. - 
To sum up: if our analyser is less resistant to background noises than 
the spectrograph, it is more precise and more sensitive to rapid changes of 
pitch, more reliable and more flexible than the other instruments of the 
Same type currently in use. 


References 


James, E. Е, (1970), “The speech analyser of the University of Toronto’, in — 
Р. R. Léon, 6. Faure and A. Rigault (eds.), Prosodic Features Analysis, Didier, 


Joos, M. (1948), Acoustic Phonetics, supplement to Language, vol. 24, no. 2. 
Porter, В. K., Kopp, С. A., and GREEN, Н. C. (1947), Visible Speech, Bell 


Telephone Laboratories Series, Van Nostrand. К 


H 
“ Pierre В. Léon and Philippe Martin 47 


Part Two 
Theory 


The articles in Part Two can be divided into two halves. Those in the ` 
first half, by Pike, Trager and Stockwell, are typically American and 
Tepresent chronological advances on the same base. The two in the 
latter half are not in this line of succession. Crystal's represents the 
British tradition, which it summarizes so thoroughly that no other 
Tepresentative is needed, Bolinger's advances a theory about a part of 
the field that has gone untreated up to now. 


Kenneth Pike was the first American structuralist to attempt more (bam а 


à programmatic treatment of intonation. He undertook, in his book 

The Intonation of American English, not only to give a thorough review 
Of studies of English intonation done in England and America up to 

that time, but to examine in a coherent way all the factors — rhythm, 
Pause, length, and stress, as well as pitch — that combine to make the 
Prosody of the language. Other American linguists, notably Zellig Harris 
ànd Rulon Wells, held similar views, but with Pike the position of 
American structuralism on intonation was pretty well fixed. His approach 
Still typifies the work done by the far-flung Summer Institute of 
Linguistics, and in modified form it is the one adopted by Trager and 
Smith (see pp. 83-6 below). у 2% 

Pike established the ‘level’ approach to intonation. It differs from the 
"contour? approach chiefly in that it regards relative heights of pitch as 
Phonemic (that is, they bear the same relationship to intonational 
Configurations as such phonemic entities as vowels and consonants bear 
to words). The term contour is used to refer to configurations, but the 
essential component is not the succession of movements (up, down, 
etc.) but the succession of levels, with movements being incidental to 
Betting from one level to another. mol EU 

Pike argued that intonational meanings are privative to intonation 
and are not to be confused with the syntactic uses to which they are 
Put; he warned against insisting on ‘question intonations’ and 
‘statement intonations’. Intonational meanings were to be diligently 
abstracted from the meanings of words and syntactic constructions that 


occur with them and from their own particular manifestations ata 
given place and time. The chapter from Pike's book is given first 
because it is first in point of time and because it lays the foundation 
with a teacher’s regard for presenting a difficult problem in a 
comprehensible way. One caution: the numbering system is the reverse 
of the one generally used by Pike’s adaptors. Pitch 1 is highest, pitch 4 

- is lowest. 
H Probably the most influential treatment of English intonation has 
|. been that of George L. Trager and Henry Lee Smith, as it appeared in 
-. their 1951 study, Ап Outline of English Structure (Norman, Oklahoma). 
Within a few years it had been adapted to a wide range of grammars 
for classroom use — books as dissimilar as the Roberts English Series 
(1967) and the Pyles-Algeo English: An Introduction to Language 
.. (1970). It has the great pedagogical merit of lending itself to a notation 
that is economical to print and easy to interpret. But it has been just as 
- influential in its theoretical impact, and has largely been accepted in 

_ generative-transformational treatments of intonation. Trager and Smith 
adopted much of the system developed by Pike. Their chief. 
modification is the elaboration of the role of stress, and the 
Е formalizing of the pitches that occur at pause points. Where Pike marks 
È a tentative pause and a final pause, with pitch behaviour conditioned 

by them (before a final pause, for instance, a pitch 4 will “Tend to fade 


into silence while drifting downward? — see p. 70), Trager and Smith 
substitute terminal junctures (in Trager’s article called contours), which 
are the intonational movements that 


punctuate the end of an intonation 
| pattern; they are regarded as elements of the prosody in their own 
7 right: rise, fall and sustain. As with Pike, stress is regarded as an 
independent, but interacting, system, which is to say that if a rise in 
pitch and a perceived prominence (what we think of as ‘loudness’ 
occur at the same time, the p; 


е I itch change is incidental to the stress, The 
study by Trager is an updating of the intonational part of the Outline. 
Until very recently intonation was the chief item of unfinished 


business for the currently most vigorous approach to linguistics, that of 
generative-transformational grammar. An early exception was the 
article by Robert P. Stockwell, referred to in his new article written 
especially for this volume. Here he summarizes his former conclusions 
and goes on to review some recent work in the field which has been 
aimed at modifying the views on accent that were expressed in The 
Sound Pattern of English by Halle and Chomsky (1968). The reader 

_ Will see how closely the latter still interlocks with Pike and i 

_ Trager-Smith. 

The British tradition is less revolutionary than the American. 


50 Theory 


Though both build on their own past, instead of successive revisions 
we find a continual broadening of essentially the same base. This is to 
be expected given the empirical and highly practical aims that have 
prompted the study of intonation in Britain. Large amounts of data 
have been examined – in the most recent studies great quantities have 
been assembled for the purpose — and systems have been strongly 
influenced by the need to publish materials for teaching English as a 
foreign language. David Crystal has made the most comprehensive study 
of English intonation to date. His work, the main chapter of which is 
reproduced here, is the best synthesis of the British approach, in which 
the concepts of ‘nucleus’, ‘tune’ and ‘tone group’ figure prominently. 
Most descriptions of intonation have been at one or the other of А 
the two extremes: atomistic ог global. The atomistic description is one 
that looks for meaningless subunits which bear the same relationship 
to intonation that segmental phonemes bear to words; this is essentially 
the ‘level’ approach, with each level corresponding to a phoneme. The — 
global description describes entire contours, giving their grammatical or 
attitudinal meanings; this is the ‘tune’ approach. The last article in this 
Part, by Bolinger, takes a different tack: it adopts pitch directions as 
its units, which it assumes resemble gestures in the way they convey 
meaning, and looks at how the directions are combined and how they 
affect the parts of the sentence that are accented and the parts that are 
not, The latter — the unaccented syllables – have been pretty much 
ignored up to now, and in some treatments have been considered not ` 


to count at all. 


References 
HALLE, M. and Сномѕку) N. (1968), The Sound Pattern of English, 


Massachusetts Institute of Technology. e 
Pyxes, T., and ALGEO, J. (1970), English: An Introduction to Language, 


Harcourt Brace Jovanovich. , istics Pi 
Ronznrs, P. (1967), The Roberts English Series: A Linguistics Program, 


Harcourt Brace Jovanovich. 


"s Theory 51 


add à: 


y Ki ^A f. 


Wr ke М е А, 


3 Kenneth L. Pike 


General Characteristics of Intonation 


from Kenneth L. Pike, The Intonation of American English, University of. 
Michigan Press, 1945, pp. 20-41. је | 


Constituted by sequences of pitches — intonation contours i 9 


Every sentence, every word, every syllable, is given some pitch when it is 
Spoken. Even a sound in isolation is produced by vibrations whose 
frequencies constitute its pitch. There are no pitchless sentences, з 

Fluctuation in pitch occurs їп the sentences of all languages. No language 
uses a pure monotone. Once a person trains himself to listen for pitch in 
Speech he notices considerable fluctuation even in the voices of persons 
reputed to be monotones. 

The changes of pitch which occur within a sentence are not haphazard 
Variation. The patterns of variation, the rules of change, are highly 
organized, Their intricacy is so great that, although one speaks his lan- 
guage with little effort, their analysis is extremely difficult and may induce 
Опе to conclude that no actual organization or rules are present, but that 
People use pitches by whim and fancy. In each language, however, the use | 
Of pitch fluctuation tends to become semi-standardized, or formalized, so 
that all speakers of the language use basic pitch sequences in similar ways 
under similar circumstances. These abstracted characteristic sentence 
Melodies may be called intonation contours. ge 

Intonation characteristics may ђе roughly divided into several types. ii 
Some contours may be completely colorless in meaning: they give to the ` 
listener no implication of the speaker's attitude ог feeling. Since sentences — 
must be spoken with pitch, and pitch sequences become formalized, these — — 
Meaningless intonation contours represent the intonational minimum of | 
Speech. They serve a mechanical function – they provide a moldinto which ^ | 
all sentences may be poured so that they achieve utterance, Nevertheless, — 
these mechanical contours may be very important for learning a language, ` — 
Since failure to use them would immediately label a speaker as a foreigner AN 
with a bad accent and hamper his freedom of style. \ 
. Other intonation characteristics may be affected or caused by the 
individual's physiological state — anger, happiness, excitement, age, sex, 
and so on. These help one to identify people and to ascertain how they are — | 
feeling (unless, along with a ‘poker face’, they have а ‘poker voice’ which | 


does not reveal these facts, ог departs from the anticipated norm in some 
_ way). 

In English, many intonation contours are explicit in meaning. Whenever 
a certain sequence of relative pitches is heard, one concludes that the 
speaker means certain things over and above the specific meanings of the 
words themselves. A change of pitch contour will change the meaning of 
the sentence: thus, horse? and horse! are different. 

А single contour is not necessarily exactly as long as a sentence. One 
sentence may have several contours, and a single contour may have 
several meaningful parts. This analysis will be demonstrated presently, 

_ but first, more detail will be given about problems of shades of meanings 
' їй the analysis of intonation contours. 


+ Accompanied by shades of meaning 
Contrasting pronunciations as evidence for different meanings 


Whenever an investigator finds a language in which a specific sentence 
can be pronounced in two, three, four, or more ways, he must investigate 
the reason for the different pronunciations. The different pitch sequences 
probably imply a changed relation of the speaker to the sentence, or of the 
sentence to its environment. It is improbable that much fluctuation will 
occur without an accompanying change of meaning. Languages which 
have mechanical intonation contours rather than meaningful ones would 
appear to have relatively little fluctuation: for example, Oto (an Indian 
language of Oklahoma) has a mechanical pitch contour in which stressed 
syllables of normal words have high pitch and unstressed ones lower 
pitch — and these relative pitch relationships seem not to be upset by 
emotional contexts; Oto has a few interjections, however, which can have 
one of several different pitch pronunciations, and these uséd in the propet 
context indicate the emotion or attitude of the speaker. In contrast t 
Oto, anyone who chooses to do so can pronounce in a dozen ways 4? 
English sentence such as / am going to town today (with surprise, exclama- 
tion, query or emphasis on different words); one must not assume that 
other languages are like English in intonation, 


Intonation meanings superimposed upon lexical meanings 
(speaker's attitude) 


English words have basic, intrinsic meanings; these lexical meanings °° 
the ones found in the dictionary. Frequently, the lexical meanings аге Yo" 
objective; for example, horse refers to an animal with four legs, S0" 
hooves, and a flowing mane and tail. Sometimes the lexical meanings аго 
less objective: for example, try does not refer to any single specific act, bu 
rather to the undertaking of some task by choice. A word may have 


54 Theory 


several lexical meanings: horse may refer to a mare, Регсћегоп, supporting 
frame, knight (in chess), apparatus for vaulting, and so on; try may mean 
to make trial of, to experiment with, to afflict. When several meanings are 
possible to the one word, the particular meaning must be chosen which is 
pertinent (‘makes sense’) to the particular context in hand. Sometimes the 
context demands an interpretation in terms of metaphor or irony — or even 
falsehood. Nevertheless, all of the lexical meanings have this in common, 
that they are indicated only by the requisite consonants, vowels and stress, 
and a context where such a meaning is possible; in that sense, the lexical 
meaning is intrinsically a part of the word itself and not dependent upon 
extraneous phenomena such as pitch produced by emotion. 

The intonation meaning is quite the opposite. Rather than being a stable 
inherent part of words, it is a temporary addition to their basic form and 
meaning. Rather than being carried by permanent consonants and vowels, 
it is carried by a transitory extrinsic pitch contour. Rather than con- 
tributing to the intrinsic meaning of a word, itis merely a shade of meaning 
added to or superimposed upon that intrinsic lexical meaning, according _ 
to the attitude of the speaker. Thus, to horse, may be added а pitch scheme . 
indicating the speaker's surprise – i.e. а horse! (or the meaning could be 


given roughly in lexical form as /ook at the horse about which I am quite 


Surprised at its unexpected appearance). In English, then, an intonation 
f a sentence by adding to it the 


meaning modifies the lexical meaning О! z to it t 
speaker's attitude toward the contents of that sentence (or an indication. 
of the attitude with which the speaker expects the hearer to react). (See 
also, for further discussion of this point, p. 57-9.) 


Difficulty of isolating an intonation contour for analysis of its meaning 

In order to study his own intonation, à speaker needs to be able to repeat 
à sentence а number of times using substantially the same pitches each 
time, so as to compare the utterances and later study the effect of deliberate 
changes or substitutions in various parts of the sentence. Such repetition ` 
is difficult; the pitches appear elusive and ephemeral, and considerable 
Practice is necessary before it can be done easily. The following imaginary 
anecdote will illustrate the problem: Paul was studying intonation, and 
Noting any new contours which he heard at odd moments. One afternoon 
he said, very impatiently, John, tell Mary that she has forgotten 10 go to 
the store; she will have to hurry to get there before it closes. Paul noticed in 
his own speech something which he had not recorded previously; so he 
repeated the sentence for analysis. In turning to research, however, his 
impatience disappeared and he became introspective. In abandoning his 
i ically dropped his impatient intonation contours, 


Impatience, he automati x R ? 
and, in Гани introspective, automatically substituted introspective 


^ Kenneth L. Pike 55 


МИС ЫИ ПР РОР Е ee 


. cant details obscures the picture of the actual 


intonation contours with slow forms, deliberate utterance, and resultant 
additional pauses and glides. Upon noticing these changes, Paul attempted 
to utter the sentence as he had-done originally, and felt foolish since the 
simulated emotion of the intonation contours was not paralleled by 
actual emotion. Persisting in repetition, Paul suddenly could not be sure 
that he was repeating accurately, since the sentence now appeared some- 
what queer and somewhat plausible simultaneously. 

A phonographic or magnetic recording preserves a sentence without 
change, and is a decided help to analysis. There are difficulties involved, 
however: for best results, one must be able to hear a single sentence – not 
Jong paragraphs — repeated immediately, and this may be awkward to 


. achieve, Further, once a faithful mechanical repetition is obtained, the 


normal non-significant variation of speech is lost, and attention is likely to 
' "come focused on details which are not semantically of importance 
> en for shades of meaning; а phonetic transcription of these non-signifi- 


Systematic organization of 
the contours. 


A musician has some advantage in hearing speech pitch, but must be 
careful not to lose all that value by falling into the error of trying to record 
absolute pitches and fixed intervals rather than relative phonemic pitch 
Contrasts in which one pitch is higher than a second and so on, but neither 
15 essentially related to any standard number of vibrations per second. 


Strength of meanings 


_ An extraordinary characteristic of intonation contours is the tremendous 
connotative power of their elusive meanings. One might hastily and 
erroneously assume that forms which change so rapidly and automatically 
could not be semantically potent. Actually, we often react more violently 
to the intonational meanings than to the lexical ones; if a man’s tone of 
voice belies his words, we immediately assume that the intonation more 
faithfully reflects his true linguistic intentions. Thus if someone says, 5 
breakfast ready yet? the sentence is either innocuous or an insult according 
to whether it is spoken nicely or nastily — and if the insult is resented, th* 
speaker defends himself by saying, J just asked if breakfast were ready ani 
she flew into a rage. This illustrates the fact that the intonation contours 
though fluctuating like the speaker’s attitude, are as strong in theif 
implications as the attitudes which they represent; in actual speech, the 
hearer is frequently more interested іп the speaker’s attitude than in 58 
words — that is, whether a sentence is ‘spoken with a smile’ or with a see 


. . Usually the speaker's attitude is in balance with the words he choose? 


If he says something mean, his attitude usually reflects the same charac 
teristic. Various types of word play, however, depend for their suc 


56 Theory 


upon the exact opposite, that is, а lack of balance between content and 
intention or attitude. If one says something insulting, but smiles in face 
and voice, the utterance may be a great compliment; but if one says 
something very complimentary, but with an intonation of contempt, the 
result is an insult. A highly forceful or exciting statement in a very matter- 
of-fact intonation may, by its lack of balance, produce one type of irony. 
Lack of balance between intonation and word content may be deliberate 


for special speech effects. 


Principles and dangers in definitions of meanings 


Once a particular intonation contour has been isolated, its meaning is 
determined by finding the least common denominator of the linguistic 
contexts or physical and emotional situations within which that contour 
Occurs. If, for example, a low slightly rising contour occurs in utterances 
Which are variously statements, queries, dependent clauses, and also occurs 
in the discussion of trees, children, algebra, atoms and cancer, while in 
each utterance the speaker is deliberating carefully on these items, then it 
is precisely the speaker’s attitude of deliberation which constitutes the only 
Contextual characteristic common to all of them. In this case, the low, 
slightly rising intonation contour must be defined as meaning a deliberate 
attitude of the speaker. As with words which may have two or more 
related lexical meanings, however, so with intonation contours one must 
Sometimes indicate a central meaning with marginal variations from it. 
For English, meanings of intonation contours are largely of this general 
type — attitudes of the speaker (or, occasionally, imputed by the speaker to 
the hearer), Most sentences or parts of sentences can be pronounced with 
several different intonation contours, according to the speaker’s momen- 
tary feeling about the subject matter. These attitudes can vary from surprise, 
to deliberation, to sharp isolation of some part of a sentence for attention, 
to mild intellectual detachment. The lexical meanings and intonational 
Meanings may coincide, as when one uses а deliberative intonation contour 
while saying the words I’m still thinking about it, Or, as has already been 
shown, the words and intonation may be voluntarily placed in conflict for 
facetious purposes. i 
In analysing the meanings of intonation contours the chief danger of 
error — an error which has vitiated much work in the past — lies in the 
failure to get the common meaning from a large enough number of 
Contexts, By abstracting the meaning of a particular contour just from a 
single context, or from contexts which are all grammatically or physically 
Similar even although that contour actually occurs elsewhere in gram- 
Matically and physically diverse contexts, one tends to assume that the 
meaning is much more concrete than it actually is; this takes place when 


d Kenneth L. Pike 57 — 


one includes in the definition of a contour the characteristics of the local 
context selected, whereas these characteristics would not universally 
appear with that contour if the sampling had been wider. Of these errors, 
the easiest to commit is to select phrases of a particular grammatical 
construction, demonstrate that a certain contour may appear on all of 
those phrases, and then claim that the contour in question means ог 
indicates that grammatical pattern — in spite of available evidence that 
that contour could appear on other grammatical phrase types, or that 
the phrases used could receive any of a dozen other contours. In an 
attempt to escape the consequences of such a method, without abandoning 
it, one may try to define a contour several times over, first in one selected 
St of similar phrases, and then in another set, and so on; this can prove 
helpful as an intermediate step, but only if one afterwards carefully com- 
pares the various definitions to find the common item of meaning which 
is basic to them all, and then discards the characteristics limited to selected 
contexts and uses the universal meaning as the definition for all occur- 
rences, Apart from such a procedure, the use of too restricted a context 
leads to great complexity by inducing multiple definition of contours, 
with a welter of rules for the types of contexts in which they occur; this 
15 quite undesirable, Since much of the complexity of rules postulated in 
this manner involves grammatical facts which for English have no innate 
participation in the meaning of the contours themselves, 
The intonation system of English is decidedly intricate, and at best the » 


ved in overlapping phenomena. In such 4 


altogether, if they are rarely encountered in Speech and are difficult tO 
classify under the limited melodies set up as Standard, Over-simplificatio? 
of a different type — possibly combined with the preceding one — таў 
consist in an attempt at predicting occurrence of contours bya grammatical 
rule-of-thumb. For example, popular non-linguistic tradition would set? 
to claim that there is a question pitch as distinct from a statement pitch 
all questions are presumed to use the first of these two, and, asa corollary 
the question pitch would not occur on statements, The evidence fails t° 
support the assumption. There are many more contours than one 19 


58 Theory 


question and one for statement. Specifically, it was a marked surprise to 

me to find that there are many different contours which can be used on 
questions, and that for any contour used on a question I could usually 

find the same one used on a statement; likewise, for all — or nearly all — 
contours used on statements, I found the same ones used on questions. 

In other words, there appeared to be no question pitch as such. This type | 
of evidence is responsible for the necessity of abandoning grammatical or | 
lexical definition of contours; definition in terms of attitudes of the speaker 
has been utilized, instead, in this study. 

Further problems in determining meanings may be mentioned briefly. 
The intonation contour may cover part of a sentence or a whole sentence; 
it is important to find in the sentences the key places which are most 
crucial to the formation of a meaningful contour. Furthermore, various 
types of intonation, such as the general pitch of the voice as a whole in 
Contrast to the different pitches occurring within a single sentence, must 
be studied separately in so far as is possible. For instrumental studies of 
Pitch, both of these cautions must be exercised, or measurements will be 
Made of lists of items which are linguistically non-significant and not 
Uniform; an instrumental analysis for linguistic purposes needs to be 
Preceded by an analysis of contrasts of intonation which in turn demands 
Careful attention to the characteristics which carry or control meanings, 


кше, 


Distributed over phrases 
Ап intonation contour is not limited to specific syllables or words, but c 
шау be spread out over as many syllables and words as are colored by the у 
Speaker's attitude. For example, ап intonation contour which begins 
low and rises slowly could be spread over three syllables, as in (He’s) doing 
it? or the entire rising contour may occur on a single syllable such as Tom? 
When a phrase becomes quite long, the contour may be subdivided, since 
à long contour is somewhat awkward to pronounce; sometimes contours 
тау be spread over long sequences of syllables without being subdivided; 
at other times the stresses and arrangement of words cause even a five- 
Slab ivided. 

DAT i oe contour occurs on а single syllable, a glide is 
formed (see also p. 65), so that the entire contour may be actualized within 
that syllable, as in Tom/? (contour °4-1). When a falling or rising contour 
is spread over a number of syllables, the pitches tend to be fairly level on 
each syllable, but the rise or fall is accomplished by steps of pitch so that 
the pitch of one syllable is higher than the pitch of one preceding or follow- 
ing it, as in ticker!? (contour *4-1), and Did you want him to buy 1212 (with | 
the same contour, but heavy Stress and low pitch on want, and high pitch d 
Оп ir), e: 


z 


Kenneth L. Pike 59 


D 


Compared to the tone of tone languages 


The two most deep-seated characteristics of intonation are (a) the dis- 
tribution of its contours over phrases, and (b) the addition of shades of 
meaning to phrases rather than the giving of lexical meaning to words. 
Both of these characteristics can be seen in contrast with a different type 
of pitch system in tone languages. 

In a tone language the pitch of each syllable is basic to the word. Pitch 
contours are located on single syllables, not on groups of syllables. Every 
syllable has a pitch which is determined by the innate nature of the word 
itself (or occasionally by the morphology or by tone sandhi); no difference 
is observed in this principle whether the tone language has a tendency like 
Chinese toward monosyllablic morphemes and simple morphology, ОГ 
like Mixtec (of Mexico) toward dissyllabic morphemes, or like Navajo 
(of USA) toward morphemes which may be part of а syllable — often а 


—  Singleconsonant – or entire syllables in an extremely complex morphology. 


- Further, the tones of tone languages, with the consonants and vowels; 
form the actual words themselves so that no word exists unless its phonemic 
tone exists along with its sounds. As part of the innate structure of the 
word, the tone contributes its share toward carrying the basic lexical 
meanings of words. Just as the substitution of [m] for [b] can change 
English bat to mat and change the lexical meaning from a *club used in 
baseball’ to a ‘fabric of plaited straw’, so in Mixtec! the substitution of 
[t] for [2] can change 2йКй ‘mountain’ to raka ‘different’, and the substitu- 
tion of [] (a low tone) for [-] (a medium tone) can change Zaki ‘mountain’ 
to Zaki ‘brush’, while žūkú (with one high tone) is ‘yoke’ (a Spanish loan 
from yugo) and Züki is *non-domesticated", Thus, the problem of defining 
meanings ina tone language is that of defining the lexical meaning of wor' 

— not first defining the lexical meanings as carried by vowels and conson- 
ants and then defining a shade of meaning added by superimposed pitch: 
In addition to their lexical pitch, however, tone languages may have 
various types of pitches superimposed upon them. Thus, the general pitch 
of the voice may carry implications of anger, disgust, Ze and so on (fof 
example, the Mixtec men occasionally run into falsetto in angry protest 


Divided into parts 


‘In order to describe an intonation contour it does not suffice to say that 
it is rising, or falling, or falling-rising. Even the simplest rise has a comple* 
series of relationships to other contours, and complex internal structUt® 

1. For an analysis of the tones of Mixtec, and a procedure for the analysis of ton? 
languages, see Pike (1948). For an analysis of Mixtec grammar, see Pike (1944). These 


investigations of Mixtec were conducted under the auspices of the Summer Institut? 
of Linguistics during annual field trips from 1935 to 1952. 


60 Theory 


The size of the interval between beginning and ending points, the height 
of the beginning point relative to the general pitch level of the sentence, 
paragraph, conversation, or speaker's norm, the relation to timing, 
phrasing, stress and pause — these and other characteristics need to be 
described for the complete understanding of any contour. 


Four relative levels at contour points 
The pitches of intonation are relative. The absolute pitch of a syllable — 
the number of vibrations per second – has no significance as such. The 
Significance of pitches is determined by their height relative to one another. 
If in the phrase John came here, a speaker gives 400 vibrations per second 
to came, and 200 vibrations per second to here, then came may be high in 
relation to Aere; but if came has 400 and Aere has 800 vibrations per second, 
then came is low in relation to Aere; that is, highness ог lowness or inter- 
mediate stages of pitch are determined by the proportionate relation of 
Syllables or phrases one to another, and not by their exact physical 
measurement. у 

In English, four relative but significant levels (pitch phonemes) can be 
found which serve as the basic building blocks for intonation contours. 
These four levels may, for convenience, be labelled extra-high, high, mid 
and low respectively, and may be numbered from one to four beginning 
With the one which is extra-high; a fall from high to low would be a change 
from pitch level two to pitch level four. 


This number is not an arbitrary one. 
levels could not distinguish many of the contours — for example, the three 


Contours beginning on low pitch and each rising to a different height. A 
description in terms of five or six levels would leave many theoretically 
Possible contrastive combinations of pitches unused. The four levels are 
enough to provide for the writing and distinguishing of all of the contours 
Which have differences of meaning so far discovered, provided that 
additional symbols are used for stress, quantity, pause, general height of 
the voice, general quality of the voice, and so on. In this paper, the con- 
tours dependent upon the four levels will first be described and then a brief 
description will be given of some of the further modifying speech charac- 
teristics, 

The distance between the four levels of English is not mathematically 
fixed, uniform or predictable. It varies from individual to individual, and 
the individual varies his own intervals from time to time. For general 
Purposes, and until instrumental studies can determine the average Spread 
Of intervals and their fluctuation, one may assume that the intervals 
indicated by the symbols in this paper are more or less equally distributed 

tween high and low. 


A description in terms of three , 


a4 ^ Kenneth L, Pike 61 


A 


The pitch levels appear to be nearly or completely meaningless by them- 
selves. It is the intonation contour as a whole which carries the meaning 
while the pitch levels contribute end points, beginning points, or direction- 
change points to the contours — and as such are basic building blocks 
which contribute to the contours and hence contribute to the meaning. 

Nevertheless, some generalization of usage can be made: there is a tendency 
for pitch contours which include a pitch of level number one (except for 
contours °1-2 and °2-1) to contain some element of surprise or unexpected- 
ness; pitch two is possibly the most frequent level for normal stressed 
Syllables, while pitch four is frequent for unstressed syllables at the end of 
falling contours, and pitch three for unstressed syllables elsewhere. These 
latter generalizations are suggestive as a mnemonic device, but have little 
technical validity, because a mass of exceptions (such as contour ?3-2 
_ With stressed 3, and °1-2 with unstressed two) indicates that they do not 
_ тећес! basic intonational organization. A more legitimate and effective 
generalization can be made by gathering into groups those contours which 
have related form (for example, contours falling to pitch four) and meaning 
(for example, mild versus intense contrast or pointing). Once these groups 
are established, some interrela 
postulation of meanings for 

meaning of a contour falling 
meaning of contours rising to 
In determining the pertine: 


beginning and the pitch leve 
then rises (or, very rarely, 
| contour point is always рге: 
> movement changes. In the 
(the numbers will be placed approximately under them). 


! or falling contour, two con 


Tom! Tom!? Tommy! Tommy!? Маграге 17 
2-4 2-4-3 2- -4 2 43 2. -4-3 


iphone number! telephone number Hi 
In the first five samples each syllable had at | 
that the relative pitch of each syllable was important to the establishing 
of contours and their meanings. In the last two samples, there were mor? 
syllables than contour points. These extra syllables can be pronounce 
with intermediate pitches in a general descending Scale, or with considerable 
variation in the amount of drop from syllable to syllable. There mai 
many more than four actual levels, but it is the contour-point levels whic 
are pertinent to the system. Compare the differences in the followi0? 


east one contour point, 59 


62 Theory 


utterances, all of which have the same contour points (except that an 
immediate versus delayed drop or rise in pitch is occasionally significant): 


telephone number! 
2 3+3 4+4 
Of) 3) з 3 4 
on? 3 4 44 
DESS 4 4 4 d 


The contour points of primary intonations 

It is at the ends of sentences that contours with the strongest meanings 
tend to occur. For example, many different contours, including 2-4, 1-4, 
2-4-3, 3-2 can be given on the last word of I want to go home; usually their 
meanings will be stronger or more prominent than the meanings of 
additional contours occurring earlier in the sentence. (If the last words 
have their lexical stresses partially suppressed, and the first word receives 


а sentence stress, no additional contours are added, but the size of the 


Contour is increased, and its placement on an example like 7 want to go 
home is changed.) These important contours which frequently appear at 
the end of sentences may for convenience be called primary types; included 
in this classification, also, are all other contours which are similar in 
Structure, even though they may rarely occur in sentence-final position; 
furthermore, these same contours are still called primary when they appear 
earlier in the sentences instead of at their ends. Their structure will be 
described in the paragraphs which follow. | | 

A stressed syllable constitutes the beginning point for every primary 
Contour; there is no primary contour without a stressed syllable, and every 
heavily stressed syllable begins a new contour. In the following illustration 
there are five stresses and five primary contours; the beginning of the 
primary contour will be shown by the degree sign [есе thenumberof 
the pitch level: t 
The "boy in the ‘house is leating ‘peanuts napid 
3- 755 3- 95:3 igs 6223 (253 SE 
e stress is usually one which would 
normally be stressed anyhow, that is, is lexically determined, like the first 


d syllable of regard, and 
syllabi апу, or the secon - : 
le of table, and comp t implications. For polysyllabic words, the 


receive, or the third syllable of i С : В 
place of normal lexical stress may be determined according to their рго- 


S is, in isolation. 
nunciation by themselves, that is, in 150 
All О ЛУК A Ap. when pronounced by themselves һауе а stress 


and a primary contour. However, it is inconvenient to consider the isolateq 
form of the word as the most basic one. А simpler statement of the stresses 


The syllable which receives th 


= Kenneth L. Pike 63 


_ of the language as a whole is obtained when one assumes that the most 
.— basic pronunciation of the stress or lack of stress of a monosyllabic word — — 
is that which is found in a phrase of the normal type. By normal type, in 
this instance, is meant a phrase which does not suppress the regular lexical 
|. Stress of any of its dissyllabic nouns, adjectives, main verbs, and the like, i 
. mor add special sentence stresses to those syllables which are usually Ў 
.. unstressed. Thus, The teacher is coming is not a normal phrase, because 
the stress is partially suppressed on the word coming. Likewise, certain 
monosyllables are stressed in normal phrases, but others are without 
_ Strong stress there. Thus, the phrase The !boy is ‘coming is a normal phrase, 
because the stresses are regularly retained on boy and coming, but the 
d phrase The boy 'is coming is an abnormal one because the stresses are 
. partially suppressed on boy and coming and one is added to the word is. 
Those syllables which regularly receive stress in dissyllabic words, and 
. those monosyllables which regularly receive stress in normal phrases, тау 
t Conveniently be said to be innately stressed, even though these stresses can 
h be partially suppressed. In general, those monosyllables which are innately 

- Stressed are the nouns, main: verbs (i.e. not in auxiliary position before 
p^, other verbs), adjectives, interjections, indefinite pronouns, demonstratives, 
. interrogatives, and adverbs of time, place and manner. Those mono- 
d syllables which tend to be innately without stress include the personal and 
А reflexive pronouns, auxiliary verbs and adverbs of degree 

Any syllable which is innately stressed is potentially the normal begin- 
ning point of a primary contour. 

If a single contour is spread over Several words, each with innate 
stresses, only one of the syllables is permitted to be very prominent. This 
extra prominence comes (1) from optional added intensity on the syllable 
at the beginning point of the contour and (2) from obli atory lessene' 
intensity on the remaining innate-stressed Syllables of the ior (How- 
ever, the reduced stresses usually remain somewhat more intense than 
those syllables which have no innate stresses at all. A hes which is 
retained or added will be marked ['Jif it is fairly strong ог ["Jif it is intenselY 


and emphatically strong.) This effect may be heigh 
t th О 
the prominent syllables. Notice окп. by the рири 


1 I loti the syllables lessened or heightened ЇЙ 
prominence in the pronunciation of the following illustrations (as above 
o 


the beginning of the primary contour will be Shown b: ign Г 
before the number of the pitch level): алеје“ 


He's 'coming today. 
°2- -4 
— He "wanted to buy it (but 'couldn't). 
Í n -4- -3 4- 92-4 
E ; \ 
. 64 Theory 


He "wanted to invest in securities (but 'couldn't). 
°1- gua E 


(In the preceding set of illustrations, the words годау, buy, invest and 
Securities have had their syllables with innate stress lessened in prominence, 
whereas wanted had its innate stress made even more prominent by 
emphatic stress.) е 

For special effects the beginning point of a primary contour may be 
placed on a syllable which is not innately stressed, giving stress and 
Prominence to that syllable while the normal innate stress and prominence 
is removed from other syllables. Note the following sentence: n 


1 didn't say 'unvestigate, I said "investigate; the word normally is investigate. I 
4. 43 т A SETS 


An ending point completes a primary contour. If the entire contour 
Occurs on a single syllable, the ending point is constituted by the second >] 
half of that syllable, with a pitch glide connecting the beginning and | 
ending points. This is true for utterances such as Wa 


boy, why? gone!? ; | 
2-4 °3-1 °1-4-3 А | 


Тће ending point may be an entire unstressed syllable, as in > 


ticket? happy; 
932 92.4 
Li 
Ог part of an unstressed syllable as in ‘ 
ticket? happy, investigate; A 
2--4-3 92.4.3 °2- -(3)-1 а ; | 
ог part of a stressed syllable, as in e 
There's the man. M 
°2- -4-3 
5 ve 
If the contour is composed of two or more syllables as in У Ў 
ticket 2, K 
93.2 


there is a step up or down between them. 


A direction-change point occurs at the center of a small but important * | 
Minority of the primary contours, when the pitch changes from falling v 
to re To ing. This point may occur on the central p 

9 rising or (rarely) rising to falling | 


А Kenneth L. Pike 65 We 


ы... Vi, тү! \ 


art of a single syllable, or the first part of a stressed or unstressed syllable, 
- or on a complete unstressed syllable; compare pitch four in 


Tom, ticket, syllable. 
o °2-4-3 °2-4-3 °2--4-3 


е central part of a contour may be relatively level, so that the change 
oint optionally occupies one or more syllables, as in 


OM The end of a primary contour usually coincides with the end of some 
мога, as in 


- 9-4-3 92-4 


. The end of the contour, however, does not have to occur on the same 
` Word wit! ch it began — as the first illustration in the preceding series 
ез. 
| end of every word is potentially the ending of a primary contour. A 
contour which contains several words may, under special conditions of 
_ attention, emphasis, and the like, be broken up into as many contours 45 
У ts there are words. Compare the following set of illustrations; more samples 
. Could have been constructed from the same sentence: 
He wanted to do it. 
°2- 4 
— He wanted to do it. 
°2--4 °2- -4 


= He wanted to do it. 
°2-4 92-* 4 
He wantedto do it, 1 
92-492--4 92.4 *24 


` А primary contour may end in the mi 
emphatic stress, or if the word has two inn. 


j 


ddle of a word under special 


ate stresses, as follows: 
lin" adequate 
| °2-3°]- A 


ү 


Primary contours frequently begin in the middle of words, as in in'sipid 
°2-4 
(For the part played by the beginnings of words in the establishing of 
intonation boundaries, see precontours [pp. 67-8] and rhythm units [pp. 
72-6 and pp. 79-81].) 


Precontours within total contours 

Immediately preceding the stressed syllable of а primary contour there 
oftentimes will be one or more syllables which are pronounced in the same 
burst of speed with that primary contour but which themselves are 
unstressed, These syllables may be called precontours, and depend for 
their pronunciation upon the syllables which follow them. They may 
Constitute grammatically independent words, like a, he and under, or they 
May be parts of a word, as in re(ceive) and invo(cation). In innate structure 
they may be lexically without stress, or their innate lexical stresses may 
have been partially suppressed. Notice the difference of grammatical and 
lexical type in the precontours of the following illustrations: 


the ‘boy 
3- °2-4 


He 'said so, 
gren 24 


an interesting "house ( — "ло? an interesting "barn) 
3- RÉI °2-44- °2-4 


The different precontours have meanings, but in general their implica- 


tion of the speaker’s attitudes is not so strong as that of the primary 
Contours, As an illustration of distinctive precontour significance, notice 
that of the two following utterances the second portrays a much more 


insistent attitude than the first: 


1 want to go home. 
°2-4 


1 want to го home. 
> ?2-4 D 
А primary contour with its unstressed precontour knit closely to it in 
Pronunciation forms a single intonational unit, a total contour. In the 
illustration immediately above, the intonation of I want to go is a pre- 
Contour, that of home is a primary contour, and the entire pitch sequence 
is a total contour. If a primary contour happens to have no precontour, 


^ | Kenneth L. Pike 67 


( 


— are total contours: 


| 
э. 


"ў 3- 24. 43 


ү 


Ч 


. 
74 


` shorter in length than the final one, but it is not always so. 


ko 68 Theory 


it constitutes a total contour by itself. Thus, both of the following items 


"Тот did it. 
ES 4 


‘The ‘man did it. 
3- ?2- 4 


Tn succeeding examples, numerals which symbolize key points in a single 
total contour will be connected by hyphens; the precontour numeral will 
be followed by a hyphen, arid parts of the primary contour will be joined 
by hyphens also, as in the first of the following samples; the second sample 
contains two total contours: 


_ Good morning Тот. 


The doctor bought a car. 
3- °2-4-3 4- °2-4 


Precontours usually begin coincidentally with the beginnings of words, 
asin the тап. 


3- °2-4 
Related to pause and rhythm 


Intonation contours are intimately related to pauses and to rhythm, 25 
this section will demonstrate. Nevertheless, intonation must be Кере 
distinct from these latter speech characteristics, since in many respects 
they are independent one of another. Pause and rhythm are closely 
dependent upon one another in some of their elements and usage, but in 
other ways are independent, and so must be handled as separate signifi- 
cant entities (that is, as different types of phonemes or morphemes; fof 
a summary of these differences, in operation and contrast, see p. 82). 


Pauses (tentative and final) 


When a person makes a cessation of speech, there is a pause. 'There are 
two significant types of pause (i.e. two pause phonemes ог morphemes) K 
a tentative one and а final one; these may ђе symbolized by a single ап 
double bar, [/] and [//], respectively, and have the meanings indicated bY 
their labels. 

Either pause type may vary in length; the tentative pause is usually 


The tentative pause has one very important alternate form: instead of А 
a gap in the speech, a complete cessation, there may be а lengthening of D 
{ 


the last sound or two of ће preceding word. This length takes up the same 
time as the physical pause would have done; there is no confusion with 
the normal sounds which are relatively long, nor even with lengthening 
for emphasis, since the elongation for the equivalent of pause is accom- 
panied by a considerable weakening of the strength of the sounds, and it 
is this weakness of sound plus the length which can substitute for physical 
pause in the tentative pause phoneme. (Phonetically, then, the sentence 


The man is here 
3- °2-4-3: 3- °2-4// 


has a prolongation of [n] but, phonemically, it has a pause, to be written 
thus: Р 


The man is here) 
3- °2-4-3/ 3- °2-4 


On a dictaphone record I once heard the following sentence: 


Listen for the syllables that have high pitch and heavy stress. 
93-2: 3-  92- -4-3: 3- 952: 53 [4- °2-2: °2-4// 


Tn order to rewrite the sentence phonemically, one needs only to substitute 
Single bars, or tentative pauses, for the colons indicating phonetic length. 
The tentative and final pauses affect in different ways the material which 


precedes them. The tentative pause tends 


(1) to sustain the height of the final pitch of the contour. A ?2-4 contour, 
for example, before a tentative pause tends to end on one or more syllables 
On pitch four without drifting downward; there may prove to be an occa- 
Sional slight drift upward, although never as much as is found а rise 
from significant level four to significant level three. In addition, 

ffects the quantity of the preceding contour 
y defined. The syllable preceding a tenta- 
sual, sustained on a level pitch. At other 
f the primary contour that carries length 
nce of the tentative pause. On the other 
hand, the departure from the undefined norm may be in the Opposite 
direction, and yet give related results: à Very short ending often indicates 
that a tentative pause follows.- The same person, repeating the same 


Sentence, may utilize different means for similar results in various repeti- 
tions of the same sentence. In general, it may be that any departure from 
the normal length of the elements of a primary contour contributes to the 
Tecognition of a following pause 25 tentative, provided that the full height 
Of the pitch is sustained at the end of the contour. 


(2) the tentative pause often a 
in various ways not as yet clear! 
tive pause is often longer than u 
times, it is the beginning point О! 
and so gives the clue to the prese 


1 7 "n Kenneth L. Pike 69 


The final pause modifies the preceding contour (ог contours) by lowering 
in some way the normal height of the end of the contour. If the contour 
itself ends in pitch four, then preceding a final pause it will tend to fade 
into silence while drifting downward; this is considerably different from 
the pitch of the same contour which has a somewhat level, possibly sus- 

` tained, ending when it occurs in the middle of a sentence without pause, or 
when it occurs before a tentative pause. If the contour is a falling-rising 
one, the rise appears not to go quite so high as it does in the middle of a 
.. sentence without pause, or before а tentative pause. This conditioned lower 
height, of a ?2-4-3 ending, is still markedly higher than the sustained end 
of a ?2-4 contour preceding tentative pause (however, a person who has 
not had the difference called to his attention, by contrasting pairs, is likely 
... to confuse а °2-4/ with а °2-4-3// or even °2-4-3/; once it is pointed out, 

— however, the contrast is usually obvious to him). 
Compare the following words when they are pronounced as part of à 

А Series, and when they are the last of the group: 


[ Apples, pears, oranges, plums, peaches. 
°2- 4-] °2-4] °2- "a °2-4] *2- AN 


, | ( bought some) pears. 


c OWER °2-4// 
= I bought some plums. 
3 °2-4]] » 


The difference between tentative and final pause is sometimes heard in 
an exaggerated form in fiery oratory. With some public speakers, one can 
know for some time in advance — say three short phrases or more — that 
the speaker is coming to the end of a paragraph or section of his oration, 
by the *running down' modifications of his intonation contours as the , 
pitch is let down before such a final pause. 

Of the two pauses, the tentative one tends to occur at all places where 
the CS de of the speaker includes uncertainty, ог non-finality. It is found» 

` then, in hesitation, and after almost all questions — сет 
to be a few exceptions when a person pns a nus ЛШ a: fallin 
contour) without wanting an answer, or when he assumes the answer t° 
= be certainly known. When a pause occurs after a rising contour, I ha€ 
found only the tentative type, both after questions, and statements, 91 
= parts of statements (but occasionally I have found the final pause after ? 
Й falling-rising contour). А pause іп the middle of a sentence is usually 2 
~ ` tentative one, but by no means always so. Notice the following illustrations 
| of tentative pauses: 
| 


EL 20 Theory d Е 


E 


I think I'll... 
85 525 3] 


ТД са 
Брај 3-/ 


Has he gone? 
4 °3-2/ 


Where has he gone? 
3- °2-4/ 


He bought it? 
3-94. 1-17 


I wanted to do it, but I couldn't. 

4- 2- -4/ 4- All | 

(Contrast: 7 wanted to do it, but fi couldn a f 
4- 92. -4--3] 4- *2- 4 | ER 


"Spanish is a "beautiful AP à 
ud udis des н ime of pause 
The final pause occurs where the speaker's WE SE Ge = 
is one of finality, and for this reason occurs urrence almost — but not 
Statements, The final pause is limited in КЫША ООА 
Quite – entirely to a position after а eon e АЕ ЕРЕ 
Occasionally it is found after °2-4-3, and fu 


tly in this position 
elsewhere. Since the tentative pause also ond A contrast. Gore 
‘it is principally here that the two pauses may 


Pare the following illustrations: 


t 
T'm going. (Implying, possibly, and that’s tha ) 


" 92-4] 
Tm going . . . (Implying, possibly, 
ipi Ө . d 
has either kind o: 
тле а VEO M ord d elaborate com; 
Within a space of a number of sentences. 


i its, see pp. 76-9.) f 
thythm unit; for the analysis of such un! па и се а М, 
Е requently pauses in the middle of senten 


its i h a way as to con- 
aller units in suc 5 
unit or separate sm illustration, a routine pause 
bled ет unity. In the ima) 3 in conjunction 
Separat i ; in the second illustration level intonation- set off the 
With unifying rhythm (pp. 79-81) and unifying lev 
nifying rhy: . У 


Unit three plus two: 


if you do not dissuade me) 


d d Kenneth L. Pike mn ` ` 


~. MERE RN D at TC TT t 


e да 


If'Tomgoes, I will 'too. 
3- ?2-. -4-3] °2-4- -4-3 °2-4// 
"Тео, times three ‘plus two, is Чеп. 

°2-4-3/4- "2 71- -4-3/ 3- °2-4// 

It is after primary contours that pauses may usually be found, as in the 
previous illustrations. Hesitation forms, however, sometimes end without 
a primary contour, and have merely an unfinished precontour. In this 


- event, a pause may occur at the end of an utterance in a place other than 
_ at the end of a primary contour, as in the following illustration: 


But һе... 
ez E 
Sometimes also, a tentative pause occurs in the middle of a primary 
contour. This tends to set off the second part, as some type of parenthesis, 
or form of address, as in the next illustrations, Notice that the second part 
of the divided primary contour has no strong stress and that potential 
lexical stresses are sharply reduced in intensity (see also p. 80): 


No, Tom, (I do not.) 


°2-4-/ -4-3] 4- °3-4// 
No, he said. 
°2-4-/ AN 


Simple rhythm units (stress-timed and syllable-timed) 


ааа аге spoken with recurrent bursts of speed, with long 
or Short pauses or with intonation breaks between, A sentence or part 0 


a тше spoken with a single rush of syllables uninterrupted by а pause 
E Wa aio cunis following utterances are usually spoken as single 
rhythm units: tle car; intonation; here it is; ће said he would; a jumpin§ 
jack. The next group of utterances wi ) 


e ould tend to be broken into tw? 
rhythm units each: I want to go but I can’t; If he comes he'll buy it; ever? 
day is Pepsodent day. Se 


rhythm unit which contains one, and only one, primary contour isa 
p . А 
$i hythm unit. Notice the one Strong stress and the one prim 


contour in each of the following simple rhythm units: 
the uni'versity 


3- °2- AU 

' Robert must do it. 

°2- -4]] 

The 'manager is the one who purchased її. 
4- ?2- E 


e 


72 Theory 


es du. 


The timing of rhythm units produces a rhythmic succession which is an 
extremely important characteristic of English phonological structure. The 
units tend to follow one another in such a way that the lapse of time 
between the beginning of their prominent syllables is somewhat uniform. 
Notice the more or less equal lapses of time between the stresses in the 
sentence The,'teacher is 'interested in 'buying some ‘books; comparé the 
timing of that sentence with the following one, and notice the similarity 
in that respect despite the different number of syllables: 'Big ‘battles are 
‘fought ‘daily. 

(Controlled strictly and mechanically in poetry – and possibly partially 
50 in some types of elegant prose - the recurrent stress timing is perhaps 
even more important than the number of syllables in iambic or trochaic 
groups, or the like. Evidence of this fact is seen in the esthetic satisfaction 


Obtained by English speakers from some lines of poetry — such as Break, Е 
break, break – which do not have the full complement of syllables normally — 
to be found in the scansion of other lines of the same poems.) vi 

al which bas ` — 


The tendency toward uniform spacing of stresses in materi h has ` 
uneven numbers of syllables within its rhythm groups can be SCH So 
only by destroying any possibility of even time spacing of syllables. Since | | 
the rhythm units have different numbers of syllables, but a similar time 
value, the syllables of the longer ones аге crushed together, and pro- 
nounced very rapidly, in order to get them pronounced at all, within that 
time limitation. This rhythmic crushing of syllables into short time limits 
is partly responsible for many abbreviations -in which syllables may be 
Omitted entirely – and the obscuring of vowels; it implies, also, that English 
Syllables are of different lengths, with their length of utterance controlled 
Dot only by the lexical phonetic characteristics of their sounds but also 
by the accident of the number of syllables in the particular rhythmic unit 
to which they happen to belong at that moment. Compare the similar 
timing and stresses but variant number of syllables in the following pairs 


Of illustrations: 


The ‘man’s ‘here. 1 
3- °2-4-3/ °2-4// Kei е 
The 'manager's ‘here. X 4 


3- 9243 [°2-4]] 
If "Tom will Ч will. 
3- 92. 4.3/0. -4 || 
I'm do it Ч will. 


3- 92, 4 E 
- 54- -3/ °2- [| Д N 3 
2. For this basic principle of the timing of rhythm units and for similar illustrations 


am indebted to Daniel Jones (1956, 4886-20). \ 


Kenneth L. Pike 73 | 


шы. ^ et me ӨЧ 


\ 


A single rhythm unit from such a sequence of units may be considered 
the regular or normal type. Because its length is largely dependent upon 
the presence of one strong stress, rather than upon the specific number of 
its syllables, it may conveniently be labelled a stress-timed rhythm unit (а 
phonemic type in contrast to syllable-timed units to be mentioned below, 
with both of them on a different level of contrast from the simple versus 
complex rhythm types). 

Many non-English languages (Spanish, for instance) tend to use а 
thythm which is more closely related to the syllable than the regular stress- 
timed type of English; in this case, it is the syllables, instead of the stresses, 
which tend to come at more-or-less evenly recurrent intervals — so that, 
as a result, phrases with extra syllables take proportionately more time, and 
syllables or vowels are less likely to be shortened and modified. 

English also has a rhythmic type which depends to a considerable 
extent upon the number of its syllables, rather than the presence of à 
Strong stress, for some of its characteristics of timing; in English, however; 
the type is used only rarely. In these particular rhythm units each unstressed 

“syllable is likely to be sharp cut, with a measured beat on each one; this 
recurrent syllable prominence, even though the stressed syllables may be 
extra strong and extra long, gives a "ранеппр' effect. The type may be 
called a syllable-timed rhythm unit (in phonemic contrast to the stress- 
timed type). 

If the unstressed syllables are each made 
somewhat staccato. If the unstressed syll. 
timed, and somewhat Prominent, but 
general impression is that of a 
tence, in this latter style: 


quite abrupt, the unit becomes 
ables are more or less equally 
glided or smoothed together, the 
spoken chant. Consider the following sen- 


Susie is a tattle tale. 
°2-2- °3- -1- °2--2- °3-3// (Chanted with syllable- 
Words in a very close grammatical ass 
the same rhythm unit. Notice the gr; 
words in the following illustration: 
the b S 
3- °2-4// 
He's gone. 
3- °2-4// 
Come in. 
3- *2-| 


It's a big one. 
3- | "EH 


timed rhythm) 


ciation are likely to belong tO 
ammatical relationship between the 


74 Theory 


Hit him. 
*2- AU 
When is it? 
3- ?2- -4/ P 
hythm 
i i tress tend to join that r 
i no innate lexical s 5 ; 

vcn cce pue them with which they Ше каидан са 

Oup prece HAN 3 Kos Я 
Tie en related, as the following illustrations dem 


I'm going to, tomorrow. 
3- 92. A 3524 [| 
Не gave it to the man. 
3-72- -3/ 3- °2-4// 
Whom did you tell it to yesterday? 
#2-3/ 3- °2- -3/°2- All xd M 
x (pp. 76-9) rhythm uni 
inni i r complex (pp. у n t (but 
n fes mem i а pn always coincides with the beginning 
Ota weak rhy ‚р. 
Of a word, as in | 
the boy, 
3- °2-4/ and so on. 


it likewise tends to 
` lex rhythm uni 
inni simple or comp! ther the total contour 
Tus prine оа SC of a total MN ШШК СООК ds 
Oincide ue t dins or begins directly with а p 

Bins with a prei 


the boy ` and Don't. 
Зе ` "Za 


i ds to end coinciden- 
he endi f a simple or complex rhythm unit ten 
The en Ing o 


lace where some 
i also, at the р n 
wily with some words ne E Potentially, then, the begin- 


hythm units and two total contours 
у 


i r 
illustrations (in the second, std unit and one total contour of the first 
Occupy the place of the one г! 


illustration): 


Jim has gone! 
WEI 


s р П 1 ^ [v 
NM e eut cT krute, ^ 


Ў г 


3 
Й 


/ Jim has gone! 


°2-4-3] 4- °1-4// 


In traditional English orthography, a punctuation mark usually, but 
not always, represents (1) a pause, and, therefore, (2) the end of a rhythm 
unit; in addition, it sometimes gives (3) a partial indication of the attitude 
of the speaker — a fact which, in turn, conditions the stress placement, or 
degree of stress, or intonation placement, or specific intonation, or even the 
quality of the voice, or some combination of these. Punctuation marks are 
often supplemented by italics, or capital letters, and so on, to make the 
stress and intonation type and placement more specific. 


Complex rhythm units (including syllables in double function; 
-intonation breaks; parataxis; unification rhythm) 


. Frequently — especially in fast speech – two ог more simple rhythm units, 


each with one primary contour, may be coalesced into one large rhythm 
unit. Such a combination comprises a complex rhythm unit, and contains 
at least two primary contours with no pause between them; the loss of a 
5 pause between two simple rhythm units changes the combination into à 
л single complex one. The complex unit, like the simple one, has just one 
rush of syllables without a pause, but contains two strong stresses instea 
of one. In the following pair of sentences, notice that the first contains tWO 


simple rhythm units, but that the second sentence has only one unit, which 
is complex. 


The 'children of the community are ‘interested. 
3- 7 -4- -3/ 4- ?2- A 


The 'children of the community are ‘interested, 
gigs -4- -3 4- °2- Al 


In the middle of a complex rhythm unit, a borderline syllable may serve 
in a double function. These syllables are the precontours of followin 
primary contours, but at the same time give the impression of changing 
previous level primary contours into falls or rises. Such a syllable may be 
recognized іп the symbolism by the fact that а hyphen immediately follows 
it, to join it to the succeeding primary contour, but a second Һурће? 
immediately precedes the syllable to join it to the preceding prima! 
contour which is level and short. In the following set of illustrations, the 


| first sample contains two simple rhythm units and two total contou? 


with a pause between them. The second has one complex rhythm unit bU 
with the total contours separated. The third has one complex hat 
unit with a syllable in double function between, 


76 Theory e 


а 'book of 'stories 
3- °2-3/3- °2-4/] 


а 'book of 'stories 
3- 92-3 3- °2-4 | 


а 'book of 'stories 
3- 72- -3- °2-4 | 


(А fourth type seems to be much less frequent; it has no down glide on 
the first primary contour which, nevertheless, is short but followed by a 


pause: 


а ‘book of 'stories.) 
32.92. pasce 

Although in the preceding section the relationship of the Venns um 
end of a (simple or) complex rhythm unit to the beginning and end o 
Words and total contours has already been discussed, a further point 
Should be emphasized. Since two (or more) primary maine within tte 
ina complex rhythm unit, the first of these contours en 5 ЊЕ їп S 
rhythm unit; this implies that the contour border medial 4 teed 
Coincide with the rhythm border in such cases. i ae А ea Ee 
ending constitutes an intonation ne РЕН EB ee a 
Which is still capable of influencing the rhythm. 

Poeni, an intonation break may er ge un de. 
all, but the potential after a primary соп Е E dex 
Where, A sli slower rate of utterance will often break a comp! 
rhythm ipd simple units, even without a marked ele the 
Speaker’s attitude or attention, simply by een, RES in 
first primary contour. In general, pauses can be EE Gage 
the sentence only when the speaker changes his attitude > 
Speed, o asis, quite sharply. A : 

er aeris contribute to the continued Cu ке 
апа unity of а primary ог total contour even m Ka EE 
Shadowed by the rhythmic unity of a larger WE M jex iym 
individual contour unity is further maintained within 5 d GE SS 
Unit by the tendency of the unstressed syllables (or i 3 2 ЕН ЕДИ, 
Closely to the total contour of which they are immedia V A M te the 
Other total contours in the same complex rhythm nan those of the 
Unstressed syllables of the precontour are en this difference in speed 
епа пр of the preceding primary contour,” and KS (А 


3. lam iidebted to lastrücbental studies of Classe (1939, pp. 116; 127), for measure. 


Ments which called this to my attention. 


А | Kenneth L. Pike 77 


173) ч y И A + 


sets them apart. This coherence of the parts of a total contour is sym- 
bolized by the hyphens which connect their parts. Hyphens are not used 
| to connect two total contours, except in the case of syllables in double 
function, which are precisely the ones which at times tend to eliminate à 
barrier between contours. 

In the first of the following illustrations, for example, notice that the 
hyphen arrangement signifies an intonation break of organization between 
go and but even when no pause occurs there. In the second illustration 
notice the optional pause. In the third sentence the intonation at the 
end of the first primary contour is modified to adapt itself more easily to 

j slow deliberate speed within the same essential pattern of attention. In 
; the fourth sentence, the speaker's attention switches from the desire to 
- himself and to the action; this change of attention is accompanied by an 
. added pause, an added total contour, an added rhythm unit, changed 

Tu place of stress and modified intonation types. The fifth sentence retains 
. . the intonation contours of the fourth, but combines them into a single, 
jd rapid (pauseless), complex rhythm unit. 


І ‘wanted to go but 'couldn't. 
3- ?2- -4 4- °2- -4 || 


І 'wanted to go but 'couldn't. 
3- ?2- -4| 4- 92. -4 |] 


I 'wanted to go — but couldn't, 
3- °2- -4-3/ 4- °2- -4 || 


LI wantedto'go but'couldn’t, 
02-4] 4- °2-4] ` 4- °2--4]] 


І wantedto'go but" couldn't. 
°2-4 4- 92-4 4- °2- -4]| 


Rhythm-unit barriers, whether between simple or complex units, may 
have grammatical significance. Two items which are not related by the 
grammatical subordination of one of them to the Other, or by a close linking 
together as parts of a favored construction (such as the two main elements 
of a clause — the subject and predicate), may nevertheless be considere 
parts of a single grammatical unit if they are both within the same simple 
or complex rhythm unit. This relationship constitutes a type of parataxis Ў 
In the following set of illustrations, the first shows two rhythm units, wit 
two items grammatically separate; the second illustration shows the 587" 
two items grammatically united because of their inclusion within à sing? 
complex rhythm unit: 


78 Theory $ j 


LA РВИ УН , 


It’s ten. o'clock. I've got to go home. 


8- °2- -3- °2-4// 4- °2-4// 
It’s ten o'clock ; I've 201 to go home. 
3- 92. -3-°2-4 4- °2-4// 


It should be noted that the preceding illustrations each have the same 
intonation contours, but are distinguished by their rhythmic organization. 

A complex rhythm unit may become very long and involved, and the 
paratactic relationships intricate, if the speaker happens to be of the type 
(fortunately very rare, for the effect may be unpleasant) who gives the ` 
general appearance of introducing no pauses for emphasis or grammatical ` | 
Separation unless or until he runs out of breath. The following passage ` 1 
illustrates this abnormal type; observe the rhythmic organization; in 
Teading it no pauses or hesitation should be introduced: 


He 'said he was ‘going but he ‘didn’t do ‘anything to get ‘under ‘мау 
3- 92. A 9244. A "2 -4-?2- -4 Из 92-4 72- 


and he ‘came to the ‘door He ‘stood there like a ‘dunce He just ‘watched 
E 92. Al 9). 4. %2- 4 4- 22. -4- °2-4 


'other people 'pack their things He 'didn't help. at ‘all. \ 
°2- -4 °2- A 92-  -4- 524 92-1 All D 
А certain type of complex rhythm unit has an implication of unification. | 
It seems usually to be characterized (1) by the presence of two (or more) 
Strong stresses, (2) by a relatively rapid rush of syllables joining them 
together, (3) and by the absence of pause. Further, in relation to intonation 
there appears to be a minimum of contour separation in the middle of the 
Complex, and the first contour of the series is usually level. In combination b 
With the intonation appropriate to the context it 15 often utilized to show 3 
grammatical unity, or unity within mathematical parenthesis, ortheunity of E 
a Phrase label, and the like. Compare the following illustrations for the types: ¥ 


He's a big man. (Contrast: a big тап) 
92 °2-4// °2-3/ °2-4// 


Three minus two, times five, «++ 
"2 *2-2 °24-3/ 4- °2-443/ 


The Chamber of Commerce, is there. 
92.2 2- °2--4--3/ 4 °2-4// 


Weak and curtailed rhythm units e 
rimary contour, the second part of 


hen а i i ts a pi 
tentative pause interrup! Ur A 
* contour then is left between pauses, constituting a separate rhythm 


^ Kenneth L. Pike 79 


Ko 


1 


unit. However, this second part of the contour either has no innate lexical 
stress, or its lexical stresses have been partially reduced (conditioned by 
their position in the total contour). The result, then, is the formation of a 
rhythm unit with no strong stress, that is, a weak rhythm unit. Two of the 
principal types of these consist of (1) an indication of the speaker, or (2) 
an indication of the person addressed. These types may be seen in the 
following illustrations: 


'This is the one, the teacher said. 
?2- -4-/ KI 


'Yes, George, it's 'time to '20. 
°2-4-/ -4-3 | 4- °2- -3- °2-4// 


The first part of the primary contour also constitutes a rhythm unit 
because it likewise occurs between pauses. This part, however, contains 
the prominent stress of the primary contour. It differs from a normal 

Simple rhythm unit precisely because it does not include the end of th€ 
primary contour; for this reason, it may conveniently be called a curtaile! 

_ rhythm unit. Yn the preceding set of illustrations, 'Yes and ! TAis is the 016. 
Constitute curtailed rhythm units. 

When a sentence, because of the speaker’s hesitation, is interrupted 
before the beginning of a primary contour — that is, before апу strong 


stress has appeared — the unstressed precontour constitutes a type of weak 
rhythm unit. Compare the following samples; 


Why... | 
44 

Inthe... 

Ээ» vf 

If he had only ... 

4- / 


Occasionally, also, the identification Of a speaker is given in à short 
‚ rapid unstressed precontour, or part ofa precontour, with a pause betwee? 
it and the quotation which follows, The first part of such an utteran® 
produces a weak rhythm unit, as in the following 


2 sample: 
І said- Iwill'not go. ' 


9-9 dd. - 72:434] 
Curtailed and weak rhythm units may occasionally be found under very 


special conditions of attention, contrast or emphasis, as in the follow! 
illustration in which a pause occurs in the middle of the word: 


80 Theory d 


^NI 


І said – Re- write. 
Т 


Usually, however, no pauseis found in this kind of emphasis within a word; 
rather the expression tends to be in a single complex or simple rhythm 
unit, as in 

I said — Re- write. 

3- °2-4--4// 


Summary of contrasts between pause, rhythm and intonation 


After the preliminary statement of the general characteristics MOS 
Contours, and the discussion of pause and rhythm, it E puc 
Summarize the reasons why no one of these can be equated phon 


With either of the others: и 
Ап intonation contour as such cannot establish the nature and borders 


9f a rhythm unit since 3 
1, Some single contours are divided into two rhythm units (that is, into 
& curtailed one and a weak one, as in 
Tom has gone ). 
°2-4-/ -4- -4.3/ | 
2. some single (but complex) rhythm units contain two or more intonation 
Contours (as in 
The doctor is here ). 

3- °2- -3 4- °2-4// 

Pauses cannot be equated with the borders of intonation contours, 
since pauses may occur 


simple rhythm 
* а! the borders of the contours (for example, between simp! y 


\ 
Uhits, as in 


He's coming today ). 
3- 923] 4.°2-4// i 
hm unit, as in 
2. the middle of contours (before a weak rhyt 


Here, said I ). 
"i v rs (in complex rhythm 
3. May be absent from a junction of two contou 


Units, as in 
Nobody came). 
743 •2.4/] 
Kenneth L. Pike 81 
L- 5 A 


“ш 


Although rhythm unit borders coincide with pauses, neither one causes. 
` the other, since a unit of speech identical in rhythmic timing may end in | 
either of two pause types (tentative and-final, as in 


"I'll eg... and Идо ). 
°2- -4/ °2- -4// 


. In addition, a unit of speech divided by a certain pause type, and ended. 
by that same pause type, may nevertheless be pronounced with two o 
more different rhythmic patterns. Note, for example, that a simple rhythm 
unit may be followed by a simple rhythm unit, or a curtailed rhythm unit ` 
_ followed by a weak rhythm unit, as in 


Help! Catherine! versus Help, Catherine!. 
Е 3 °1-4-/-4- Ai 


No o that a further type of rhythmic contrast within controlled pause? 
S between simple and complex rhythm units, as in ug 
ES Ж” B 
Tur has gone versus Tom has gone . 
БОЗИИ 24 22-3 4- °2-4// 
. A third type of contrastin 


_ exists between stress-timed 


g rhythm, within a controlled-pause contes 
and syllable-timed units, as in 
| Tommie isn't here 

"2- -2- *3-1- °2-2-°3-3/ 


Tommie isn't here. (with reg 
с °2--4 °2-4 AN 


(with syllable-timed rhythm) versus. 


ular stress-timed rhythm). 


.. References 


CLASSE, А. (1939), The Rhythm of English Prose, Oxford University Press. 
Jones, D. (1956), Outline of English Phonetics, Dutton, mii! 
Рик», K, L. (1944), ‘Analysis of Mixteco text", Int, J. Amer. Ling., vol. 10, 

рр. 113-38. 


PIKE, К. L. (1948), Tone Languages, University of Michigan Press. 


4 George L. Trager 


The Intonation System of American English 


George L. Trager, ‘The intonation system of American English’, from 7n Honour 
of Daniel Jones: Papers Contributed on the Occasion of his Eightieth Birthday, 

12 September 1961, edited by David Abercrombie, D. B. Fry, P. A. D. MacCarthy, 
N. C. Scott and J. L. M. Trim, Longman 1964, pp. 266-70. 


In the System of linguistic analysis practised by the present author, the 
intonation system of English, whether American or any other variety, 
functions as part of the syntax, delimiting stretches of utterance — clauses — 
that are examined in order to determine their structure, that is, their 
Syntactic structure. But the intonation patterns that make up the system 
àre themselves composed of certain kinds of phonological elements. And 
it therefore seems appropriate to describe the various aspects of these 
Phonological elements as the author and many of his colleagues use and ` 
Interpret them, А 
It is the intention to make this description succinct, and no discussion 

Of other interpretations and of criticisms is given, nor is there much biblio- 
Braphy. This is a presentation of a position, and a statement of conclusions, 
It is believed that it will be useful, since no similar statement that is up to 
date has yet been published anywhere. 

l. In An Outline of English Structure, hereafter referred toas OES (Trager 
and Smith, 1951), we discussed pitch and terminal junctures in sections 
1.71 and 1.72 (pp. 41-8), and in the discussion of syntax (sections 4.0 to 
“5, рр, 67-80) there was some treatment of intonation patterns, Shortly 
after the publication of О ES, the group of linguists working on the prepara- 
tion of English-teaching materials at Cornell University in 1952 and 1953 
and at various other places for some years following), and including such 


*Xcellent o as Welmers and Hockett, noted certain omissions and 
bservers as W h and I immediately recognized the 


ade the needed restatements; these 
n mentioned or alluded to in 
A couple of years later Sledd 


difficulties in our presentation. Smit 
Correctness of the criticisms, and m 
Were not published as such, but have bee 


Various sae я d others. 

publications by Smith ap à : 

Pointed out. in personal communications and oral presentations, that still 
» 


Urther details were unaccounted for. Again Smith a uA able to see 
OW these fitted into the total picture, and to make DA e e аа. 
ee about 1957 Smith especially has E SC Зе es of 
їопаноп patterns in English as markers of the boun m уш 
Units, and he has in various stages of completion a se Se 


George L. Trager 83 


about English syntax and semology, based strictly on phonologically 
pounded units. In this connection it is also pertinent to note that it has 
become possible to separate out paralinguistic pitch phenomena from 
those of language proper (Trager, 1958). It is our hope eventually yo 
publish this material either in a series of articles or in the form of a total 
revision and expansion of OES. 


2. In OES we accepted and started from the basic analysis of English 
pitch made by Wells (1945) and extensively applied by Pike (1945). 
There are four pitch phonemes (in most American usage a distinctive 
` phonological unit is called a phoneme whether it i$ a vowel or a consonant 
Or a stress or a pitch or something else). We call these ‘low’ /*/, *middle 
F^, ‘high’ DL ‘extra high’ /*/. They are found to have allophones that 
vary in height in terms of the stress (primary ///, secondary /^/, tertiary 
DL weak ["] [or unmarked]) of the syllabic that they accompany. А detailed 
. „examination of such allophones is given in О ES, 1.71, pp. 42-4. Other 
* allophones, involving contour or direction (sustained, rising, falling) are 
found at terminal points (О ES, 1.72, pp. 44-9) and involve the amount 
of segmental material covered by the pitch. 
From the material presented in OES, the following statement of 
.. intonation patterns can be constructed (examples will be given in ) 
below): American English intonation patterns consist typically of 1% 
pitches and a terminal contour. The initial pitch of the three is most often 
ei but may be any of the others. The central pitch accompanies the 
primary stress of a phrase or clause, is most often /?/ in all kinds of materi? 
у = statements, questions, ог the like, but is frequently /*/ when there Р 
[ what is usually called emphasis, and may often ђе |] or [1]. The final pite 
is most often /1/ at the ends of statements, /2/ at the ends of clauses t 
do not end sentences, /°/ at the ends of certain kinds of questions, but ™ 
ў be any one of the four. The final pitch is modified by the terminal conto" 
being sustained /|/, rising /||/, or falling 11]; sustained occurs most often! 
clauses that do not end sentences, falling in statements and interrogati" 
word questions, rising in other questions and in many non-final clause? 
When a clause begins with the primary-stressed syllabic, there are ову 
"two pitches, the central and the final, the initial being absent. 4 
The modification necessitated by the observations of. Welmers an 
Hockett involved the possibility of a fourth pitch in a clause, appe?! h 
after the initial and before the central. It has become clear that this Pi"? t 
- when it occurs, always accompanies the secondary-stressed syllabic near " 
to the primary, or, if there is no secondary-stressed syllabic, it falls 9” diis 
tertiary nearest the primary. Any of the four pitches can appear ЇЙ 
position, /?/ being most frequent. 


84 Theory í 


P 


Sledd's observations indicated that there are also clauses containing 
four pitches in which there is a pitch after the central one and before the 
final one. Again, it seems to fall on secondary-stressed syllabics, or on 
tertiary-stressed ones if there is no secondary, but there are instances 
where it accompanies weak syllables when there are no stronger ones 
present, 

Putting all these statements together, we now say: an intonation-pattern 
contains five pitch. positions, which we designate as a, b, c, d, e. No 
Occurrence of all five is known, and we believe it is not possible; the 
Occurring forms are ace, abce, acde, and, when the clause begins with a 
primary, ce and cde. The primary stress always accompanies c. If a clause 
begins with a secondary, immediately followed by a primary, it may be 
asked whether the pattern is ace or bce; we know of no way to answer this 
question as yet, but believe that bce may well be the answer; of course, if 
the clause ends as ... cde, then the pattern can only be, we hold, acde, 
since we do not believe that b and d can occur in the same clause.The 
clause ends in a terminal contour (7). The intonation-patterns are then of 


these forms: 


aceT 
abceT 
acdeT 
bceT 
ceT 
cdeT | 
by any one of the four pitches, and T 
t known whether all the possible com- 
hem have actually been observed in 


Any of the positions may be filled 
5 any of the three contours. It is no 
binations occur; but a good many of t 


material spoken naturally and recorded on tape. 


iven impli hes of material 
In mplied that rather long stretches 
OES the examples given imp АШ аа рг 


could i г ith one intona 1 
Possible E EE of an oratorical ог other literary or 
technical nature, but we believe that in most ordinary speech the clauses 
are rather short, and that most long sentences contain many pre-final 
Clauses ending most usually in sustained /|/- 


ЗА few examples may now be given. These а 
the present author. 


re as ordinarily spoken by 


асеТ — 2pm going *home* # 
рт pe *hóme! # (definitely not олеге else) 
2]m góing ?hóme? [büt I'll bè 3páck! # 
2Аге you góing 3hóme? || j 


George L. Trager 85 ` 


грт going *héme nów? | . . . (doubtful, or reticent) | 
4 ?Whére аге уби ?góing?  E?lízabeth? || k 
2Whére аге уби ?góing? |: E!lízabeth! || (less polite) 
?Whó(m) аге уби calling? /2E?lízabeth? | 
abceT It’s in ?chápter ?óne! # 
2Hé’s а ?góod ?bóy! # 
?Hé's а ?góod ?bóy?|?but . . . 
?jt's à 21дпв ?stóry? |"ánd it'll ‘bore уби: # 
acdeT i'm ?góing !'hóme! 7 
?|'m góing ?hóme ?nów! # 
2105 ?wón?derful! # 
beeT ` ?Chápter ?Óne! # 
сет Always! # 
*Néver! # 


elef — Bat Зубиг lünch! # 


A succession of short clauses: 

„ (2м [РЇ ?thínk? |?it'd bà 221. *right?|?t6 Зрб !nów! # 

4. This systematization is based on American English. We have heard 
enough other varieties, however, and have examined enough of the 
reported intonation data for them, to be convinced that the system Set 
forth here holds for the whole of the English language. The seeming great 
differences in the way different kinds of English sound in respect t°. 
intonation are due, we believe, to different distributions and occurrences 
of the pitches and terminals, within the same System. Thus American 
2Wh0’s ?thére! # 

and Southern British 

*Whó's ?thére! # 

are different exemplifications of асет, 


References 


Pike, К. L. (1945), The Intonation of American English, University of Michigan 
Press. 


TRAGER, G. L. (1958), *Paralanguage: a first aj 
vol. 12, pp. 1-12. 

TRAGER, С. L., and 5мттн, Н. L. (1951), Studies in Linguistics, occasional paper 
no. 3, University of Oklahoma Press. " 

WELLS, R. S. (1945), 'The pitch phonemes of English’, Language, vol. 21, рр. 21-39 


+ $, 
pproximation’, Studies in Linguistic d | 


n 


| 86 Theory А 


5 Robert P. Stockwell 


The Role of Intonation: Reconsiderations and other Considerations 


А paper written specially for this volume. 


In my article on this subject (Stockwell 1960) (written just weeks after my 
first exposure to transformational theory at the 1958 conference on 
English grammar at Austin, Texas),! two claims were made about intona- 
tion in grammar that I very soon came to believe were wrong: 


1, (i) That the number of surface phonological phrases tends to correspond 


Опе-Ѓог-опе to the number of deep sentences. 
(ii) That choice among alternative intonation contours is on a par with 
Choice among alternative category realizations within the base com- 
Ponent: i.e, that one ‘chooses’ contours as one ‘chooses’ lexical elements, 


There are several kinds of correlation between deep structure and intona- 
tion, but nothing as simple as (1.i). On the other hand, neither is intonation 
à simple function of surface structure, as was assumed by Chomsky and 
Halle (1 968). A good deal of work of recent vintage – in particular Bresnan 
(1971; 1972), Downing (1970), Pope (1971), Lakoff (1972), Berman and 
Szamosi (1972) and Bolinger (1972) – has borne on the question of predict- 
ing the location and form of intonation contours from levels of deep or 
Shallow structure (and to some extent surface). It is possible that the only 
‘spect of intonation that is predictable from surface structure alone is the 
Tange of ‘optional phrasing’ possibilities (Bierwisch, 1966; Downing, 
1970). The other matter on which I believe I was wrong, the choice of 
Meaning differences between contours (i.e. where the difference is not a 
function of the location of the center of the contour, or of ite PISS En CEUs 
absence of a contour, but in the actual form of the contour itself), has not 
Feceived much subsequent clarification. 

here were also claims in that early | 
Tetract, though some of them need considera 


` ‹ > or ‘normal’ or “с H 
2. (i) That there is such a thing as à neutral’ or ‘по olorless 


А ional Approach to Syntax* 
1. At which N ky presented the ‘Transformationa ERES UHR 
that was Imi Sta SOE in mimeographed form, subsequently published in Hill 


(1962) апа reprinted in Fodor and Katz (19 64). 


у work which I see no reason to 
ble elaboration: 


Robert P. Stockwell 87 


aa 27 
Suz 


e. intonation contour for any sentence, serving as a baseline against which 
- ап other possible contours are contrastable, and thereby meaningful.” 
К: (ii) That it is an intrinsic property of certain transformational rules that 
they assign to their output an intonation contour (i.e. that not all contours 
|. are predictable from inspection of phrase-markers at the surface ог any 
_ deeper level: that some contours are consequent upon the derivation 
itself).? 
(iii) That what is relocated to form ‘contrastive stress’ is the center of the 
intonation contour: that the notion ‘center of the contour’ is a distinct 
“notion within a correct theory of intonation. It should not be collapsed 
with the notion ‘stress’ that relates to levels of prominence lower than the 
. major one of each phonological phrase. ' 
` Gv) That Prepositions and Personal Pronouns (and, I should have added, 
r Several other ‘grammatical’ or ‘functional’ classes, like Articles, some 


4 2. It is true that coreferential noun phrases in a sentence require destressing of the 
4 second NP, in general, even if the second NP is not an ordinary pronoun: J voted for 
} Eisenhower even though I didn't much CARE for the general. The phenomenon of 
anaphora is closely linked with those of stress reduction and contour-center location. 
In all such cases, I believe it can still be maintained that there is a ‘normal’ (non- 
emphatic) reading which includes, as part of the specification of ‘normal’, this kind of 
anaphoric destressing, and that further juggling of the contour center produces à 
reading which must single out for emphasis, by virtue of its contrast with the norma 
reading, some otherwise unpredictable item for stress/pitch highlighting (= emphasis). 
2 3. By this I mean, for example, that the rules which perform such operations аѕ in- 
| version of subject and auxiliary to form questions must themselves Ass1GN the appro” 
priate ‘ising contour to their output. The rising contour cannot depend on deep 
structure, since the underlying form of yes: no questions and of information questions 
is identical: the difference depends only on where the WH- morpheme is attached (10 
an NP, either within an adverb such that ar what time > when at what place -> Where 
etc., or to the conjunction either/or such that WH-either + Whether in indirect ques 
tions, and is zeroed out of direct questions). Nor can the rising contour depend on SUf- 
face inversion of subject and auxiliary, since the inversion also occurs with initia 
negatives: Never have I met such а fool. Besides the rising contour of certain interroga" 
tives, it appears to me that part of the function of any rules that deal with coreferenc® 
is, at the very least, to mark repeated coreferential items as [-Contour Center], or 5010 
such specification, so that the basic rule of normal contour-center оа rough 
"Place the main stress (= contour center) on the last stressable m to the right’, Сай 
= apply correctly in the presence of non-pronominal anaphoric elements. There is ^ 
least one type of example, observed first (so far as I know) by George Lakoff and called 
to my attention in this connection by Mona Lindau, where the usual destressing 
pronouns and other anaphoric elements is reversed: Bill kicked JOHN, and then E 
kicked HIM. This is necessary because of the use of contrastive stress twice in the 5/7 
sentence: HE referring to John in contrast to Bill, and HIM referring to Bill in contras 
to John. Note that these can BE contrastive only by virtue of the ‘normal’ reading (fof 
the same string in another context) He KICK ED him, where he is Bill and him iS Joho 
The notion ‘contrastive stress’ entails, as I see it, a prior notion of *non-contrastiV€ 
‘normal’ or ‘neutral’ contours. d 


: 88 Theory Ў 


1 


4 


"AD 


~ 


Auxiliaries, Modals, Conjunctions, certain classes of Particles and Adverbs 
~ in general, all classes which can enter into satellite ‘clitic’ relationships ` 
with Nouns, Verbs and Adjectives [though the matter is not simple: see 
Kingdon (1958, pp. 170-207)]) are obligatorily destressed (or never receive 
Stress) and do not ‘count’, as it were, in computation of the center of the 
NEUTRAL contour. 

I would like to consider these various claims in relation to subsequent 
Tesearch to see to what degree a coherent theory of intonation has been 
achieved and to what extent there remain areas largely unilluminated. To 
discuss them I will distinguish four kinds of questions: 


A. The REPRESENTATION question: What is the most economical 
and realistic system of linguistic representation of the intonational facts 
(i.e. the total set of linguistically negotiable perturbations of pitch and 
rhythm)? 

B. The NUCLEA R-STRESS question: On what basis, and to what 
extent, is the location of the contour CENTER to be predicted?* (A 
Corollary of this question concerns the prediction of which items can be 
de-stressed. Destressing, and contour center marking, are two sides of the 
same coin.) 

C. The BOUNDARY question: On what basis, and to what extent, are 
the intonational ‘pauses’ to be predicted? (By ‘pause’ I mean the 


i i i ly changes: 
4. By contour center I mean the point at which the pitch contour sharp! 
the pitch skips up, or down. The center need not be the highest pitch level, nor the 
lowest, but only the point of (relatively) abrupt transition. 


He did very nicely. 
Simple assertion. 


ГМ 7 Assertion with minor reservations, 


Echo question. 


КМО Assertion with major reservations. 


Robert P. Stockwell 89 


к= ZA 


Ре 


boundaries between intonation contours, which do not correspond with 
silence, or absence of phonation, or ‘breath groups’.) ‘Pauses’, in the 

sense here intended, are uniform perceptual realities, but they may not 
_ have uniform physical correlates either acoustically or articulatorily: 
perhaps timing. 


_ D. The MEANING question: What is the range of meaning that can be 
- differentiated directly or indirectly by intonational facts? (I shall not deal 
with this question here, I include within it and the previous опе the matter 

of segregating out the endless variety of ‘tone of voice’ information that is 


not negotiated on a strictly linguistic basis. See Stockwell, Bowen, and 
-Silva-Fuenzalida, 1956.) 


= А The representation question 


_ The greatest part of the literature on intonation prior to Pike (1945), as 
t well as most of the literature produced in Europe to this date, has focused 


. on the meaning question, and of course much of Pike's own work provided 
lively and insightful analysis of subtle contrasts in meaning that hé 
believed intonation could differentiate. With Pike (1945), Trager and 
Smith (1951), and Chomsky and Halle (1968), the representation question 
received enormously more attention than it had before. This question 
_ naturally entails decisions about the relative weights of various acoustic 
ig parameters, in order to minimize the set of prosodic features required and 
arrive at an optimal phonemic representation. The most recent exemplars 

of this debate are Vanderslice and Ladefoged (1971). Lieberman (1967) 
was crucially devoted to this problem (as was some of Lieberman's earlier 
. research, including his brilliant demolition (Lieberman, 1965) of the 
` Trager-Smith analysis with respect to its claims of syntactic independence). 
The representation question can be viewed formally or informally. 

. Informally — that is, how can one unambiguously and efficiently write 
. sentences down so that they can be read back as intended — the questio? 
is not important. I am content to represent intonation with squiggly line 

or with lines of type following the contour as Bolinger does, or in the 
manner of Kingdon (1958). But formally — that is, trying to determine thé 
set of features that most persuasively account for the phonetic capacities 9 
__ man, in respect to the role of intonation in natural languages – the questio? 
.. remains one of some interest. Throughout the 19505 when this questio” 
` was of burning interest, the most brilliant contributions were made bY 
Bolinger (esp. 19582, 1958b, Bolinger and Gerstman, 1957), who demo?" 
strated beyond all doubt that what everyone really meant by ‘main stress 
or ‘primary stress’ or “heaviest accent’ was Pitch obtrusion (the famous 
accents A, B and C); and that what the Trager-Smith *superfixes* came 


90 Theory 


SN 2 bn AA ry 


down to (the famous lighthouse keeper examples) could be unambiguously 
rendered only by appropriate devices of timing (‘disjuncture’). 

For reasons which remain mysterious to me, Chomsky and Halle per- 
Sisted through their major opus in providing ingenious rules that are 
Capable of assigning levels of stress far more finely differentiated than the 
four that Trager and Smith claimed. The impossibility of finding consis- 
tent perceptual correlates for putative contrasts between two (non-nuclear 
Stress) versions of the blackboard eraser, lighthouse keeper, British history 
teacher types of examples has given Vanderslice (1970) rich ammunition 
for his amusing if sometimes pompous annihilation of the superfixes and 
their generative-transformational reflexes. Chomsky and Halle took the 
Trager-Smith data as given, and they undertook originally to show how 
these complex stress patterns were syntax-dependent. у ! 

The Trager-Smith four stresses still remain superficially intact in the 
latest foray (January 1971) of the Vanderslice-Ladefoged campaign. 
Primary is identified by them as the simultaneous features [+heavy], 
[--accent], [+ intonation] – i.e. it is that accentable syllable which falls at 
the center of the intonation contour, which is what it always was in 
Trager and Smith (1951), in Hockett’s modification (1958) of their nota- 
lion, and in the various other treatments the analysis was given (Hill, 
1958; Gleason, 1955; Stockwell, Bowen, and Silva-Fuenzalida, 1956; 
Stockwell 1962, Stockwell and Bowen, 1965). Secondary is [--heavy] 
L'-accent] [— intonation] - i.e. ‘full articulation with increased respiratory 
energy causing a pitch obtrusion’, which is the same as primary except not 
at the contour center. Tertiary is [-heavy] [—accent] [—intonation] – i.e. 
SVerything that's left except the reduced vowels. Weak ur reduced 
Vowels (or ‘reduced timing’, since there is not actual centra en in all 
Instances). One of the Vanderslice-Ladefoged PEST is to RUE 
that there are no viable stress contrasts between secon SH and tertiary 
after the nuclear syllable (‘nuclear syllable’ is anol S t he ыу up 

ases for ‘center of the intonation contour’): е2. ( Da ii) unless 
80 optional disjuncture is inserted in (3.1) for just tuis. purpose: 
3. (i) He saw a black bird, not a green one. [Contrastive] 


Gi) нез i t a crow. [Compound] 
aw a blackbird, по! judgement) that th ei See ES 
E 


{i 


But altho i ly, in my 2 S 
ugh they claim (correctly, 10 T% < till тедуҝ б ou 
со be used to resolve such шч ПЫ Ge fo d % 
Prominence, in all, for English: А SS SE E 
Operator as (4.j) (=in Trager-Smith notation (i), SSH E 
Notation (45): | 
4. (i) elevator operator 
Жа += +h— Eh MA EU 
; SES d. 
Robert P..Stóckwell 9f We 


` 


~ 


(ii) élevator Operator (TS would write élevator óperátor) 
1434 3434 14342434 
(iii) elevator operator (CH would write elevator operator) 


1f we move the contour center to the left (as in Then MURDERED ү 
elevator operator), so that the one-stress on the first syllable is dow 


H n is 
- graded to secondary, then there are still two levels of prominence on thi 


- = Chomsky and Halle two- 


|. 5. accented (=center of contour. — 


phrase, or any similar one. This is because English allows. either М 
vowels or reduced vowels at the lowest stress level, as in pairs like typhoon 
# saloon, buffoon; citation # legation. The distinction can be made а$ 
a function of the ‘heaviness’ of the vowel — i.e. unaccented syllables can Е 
heavy (have unreduced vowels) or light (have reduced vowels). Es 
solution to the much debated four-stress v. three-stress question of th 
1950s was adopted some years ago by Householder (1957) and by me xil 
practical work (Stockwell and Bowen, 1965), though until Chomsky ant 
Halle (1968) no one had stated the crucial rule which determines which 
vowels are reducible and which ones must remain unreduced – and it E: 
THAT insight which is crucial to the Vanderslice and Ladefoged kind 0 
solution. 

The levels of prominence that w 


lowing, given that vowel qualities 
rule): 


е need to represent are, then, the b 
are also represented (or predictable 


D jtch 
= nuclear stress = Bolinger’s pill 
Trager and Smith primary = Chomsky and Halle опе-57© 


= IPA strong = sentence stress = Vanderslice and Ladefoged [+int” 
nation, +-accent, -Fheavy]) 


accent — 


Stressed (= non-nuclear accentable syllable = Vanderslice and Lade 
foged [—intonation, +accent, +heavy] = Trager and Smith secondary 
and sometimes three-stress = word stress ^ 
958b) “morphological stress’) 

Unstressed (—everybody's weakest stress, universally acknowledged Me 
the vowel is reduced, sometimes debated in examples like refugee, cantet 
portray, asbestos, typhoon, austere, effigy where the relevant vowels an 
not reduced: these are the Vanderslice/ and Ladefoged [—intonatlo’” 


—accent, --һеауу] syllables, corresponding to Trager Smith tert! 
Stress) 


1РА medial = Bolinger's (1 


For the moment I take as established some 
important to the remainder of this discussio 
foged (1971) modification of the Chomsk: 
stress assignment. I think there are not m. 


t 
version (the details аге e. 
n) of the Vanderslice-L^" 
y-Halle (1968) rules for ей 
any issues of great moment 


92 Theory 


t 


under the representation question, because so much convincing work was 
accomplished between Pike (1945) and Vanderslice (1970). 3 


B The nuclear-stress question 

Tn a recent paper by Joan Bresnan (1971) we have an insight which, if it 
is correct, is one of the most persuasive and explanatory insights into 
obvious and familiar data that the MIT school has come up with yet.* 
Whatever the difficulties that she still faces in making her proposal stick 
in detail (and there are several such, both ones that she is aware of and 
Ones that have been, or shortly will be, pointed out to her by colleagues), 
the basic insight is so appealing that like some of Chomsky's first ideas 
about the role of transformations in grammar one feels it just has to be 
tight. The relevant data has been around for so long that it’s interesting to 
Speculate on the way that scientific insights come about. Clearly, in this 
Case at least it is not a matter of new data, nor even of a new observation 
about the grammatical relations to be found in the data. Stanley Newman 


(1946) cited such minimal pairs as 


6. (i) I have INSTRUCTIONS to leave 
(ii) I have instructions to LEAVE 


and pointed out that BREAD to eat ‘indicates a syntactic relation in 
Which the noun is the logical object of the verb: that is, bredd to eat has a 
relationship with“ to eat bread ^", whereas in а desire to EAT, ‘the verb 
Stands in the relation of complement to the noun’ (p. 179). One need not 
translate the preceding into the equivalent transformational jargon to 


Come up with Bresnan’s insight: namely, that. the accentuation of 
INSTR UCTIONS in (6.1) and the de-accentuation of 10 leave occurs 


in the deep structure by the regular nuclear 
to the transformation that lifts INSTR UCTIONS out of the lower 


5, : y ders not closely acquainted with recent 
There will be some difficulty, for xm details of the ensuing discussion. I have 


transi А iet]: ` ing а! 
formationalist literature, in following recise understanding of particular 


tried to i that a р! 
State the arguments in such a way " B " 
les or terms " iis for the purpose of grasping their general import for the 


eor i i was conceived as an updating of my earlier views 
cag of gamma, Sees paper DE 
ating depends on very recent work that has not yet filtere E "house a er". For o " 
able literature, part of what follows here has the баен У versim] tied some MX 
me of а so to Rm it eer H of us across the E 
;, 1% and even ignored crucial details; Г but the in-house language. I can 
ma w to talk any bu! ES ns 
only SE MEE bre us not deliberately А ао 
Т consequences, and that I believed them useful and p ! 3 


“nicate with the world outside. 
1 Robert P. Stockwell 93 


tress rule at a stage prior” 


sentence and drops it into the object slot of the upper one, into Wa 
position it carries along its accentuation; whereas in (619 LEA 
never moved away from its naturally accentuated position. 

Bresnan presents three classes of examples: 


7. Relative clause (v. Noun complement) 

(i) Mary liked the PROPOSAL that George left.7 
у. 

(ii) Mary liked the proposal that George LEA УЕ. 


у [ronti ing of 
8. Direct and indirect questions (fronting of stressed Noun v. fronting 
unstressed Pronoun)? 


(i) John asked what BOOKS Helen had written. 
y. 
(ii) John asked what Helen had WRITTEN. 
(iii) Helen has WRITTEN something. [Contained in 8.ii.] 
(iv) What has Helen WRITTEN? [Interrogative of 8.iii] 
(v) Helen has written some BOOKS. [Contained in 8.i] 
(vi) What BOOKS has Helen written? [Interrogative of 8.v] 
(vii) The parable shows what SUFFERING men can create. 


у. 


(viii) The parable shows what suffering-men can C REATE. 


6. The NUCLEAnR-sTRES 
that assigns heightened stre. 
e.g. a phrase or sentence, 

7: The phrase the PROPOSAL that George left is a Noun Phrase containing 2 
restrictive relative clause: Toughly “а certain Proposal which was left somewhere ВУ 
George’. 

8. The phrase the proposal that George LEAVE is t 
sentence Someone proposed that Geo 


rge (should) leave, 
9. It is assumed that (8.i) and (8.ii) hav 


5 RULE (NS 


R) is a rule of Chomsky and Halle's (1969) 
55 to the right 


"most stressable item in a specified domain 


А һе 
he nominal equivalent of t 


©, as their underlying abstract form, something 
like 
5 
zg Cd Ge 
| \ 
Јоћп | | 
asked 5 


WH-some books 
WH-something 
In order to derive the surface sentences from this 
marked with W H-, must be moved to the front oft 


Helen has written 


еп, 
Structure, the object of has VT itt 
he clause. 


94 Theory 


3 


9. Reduced relative clause (у. Noun complement) 


(i) Helen left DIRECTIONS for George to follow. 
De ‘directions such that George could follow them] 


v. 


(ii) Helen left directions for George to FOLLO W. 
De ‘directions to the effect that George should follow ber) 


Bresnan's claim is simply that relative clause formation and question 
formation are cyclical rules,!° and that the nuclear-stress rule follows 
them IN THE CYCLE (not necessarily immediately —'at the end of 
the cycle, before last cyclic and post-cyclic rules and of course before the 
next cycle up.) This device produces the right results by virtue of the 
peculiar way in which the NSR was formulated originally (as far back as 
Chomsky-Halle-Lukoff (1956), where it first appeared). The rule doesn't 
do what one might intuitively think a contour-center-marking rule ought 
to do, namely add a pitch-accent to the item that is singled out for the 
One-to-the-customer privilege (where the customer is a ‘phonemic phrase’ 
~ ће, everything between the two nearest boundaries of sufficient status to 
become intonationally-marked ‘pauses’). Rather, the rule subtracts stress 
from the other items in the same phrase, and renders them thenceforth 
frigid and unresponsive to the possibility of becoming contour centers 
themselves, They can be weakened further, by subsequent applications of 
the NSR in higher (i.e. later) cycles — a weakening which is vacuous in 
terms of its effect on the prosodic qualities of the phonetic output, as 
already discussed under the representation question — but since the 
NSR is set up in such a way that it will only operate on items that already 
have maximum stress, they can't be strengthened and they therefore end 
Up as destressed remnants to the right of the contour center (as in 7.i, 8.1, 
8.vi, 8.viii, and 9.i above). » 

There are а number of aspects of ће Chomsky-Halle formulation of ће 
NSR that deserve comment; but if we ignore details of the rule itself and 
focus only on the Bresnan claim that the rule applies cyclically, the sub- 
Stantive content of her claim is that in normal, neutral intonation patterns 
the center of the contour is determined Ву гће sequential order of items in 
the deep structure, such that if a stressable item is the right-most one 
Subject to the NSR, then the stress that it receives is carried with it if it 


10. Cyclical rules are, for the present purpose, transformational rules which apply in. 


Sequence to the lowest (most deeply embedded) sentence in a phrase-marker, and then 

Teapply to the next higher phrase-marker, and so on. Some rules can be shown to be 

арріісаЫе only in the last or top-most cycle (such as Auxiliary Inversion, which de- 

Des What ts ng doing? from (I don't know) what HE 15 doing), whereas others such as 
assive Formation must be ordered among the cyclical rules. 


Robert P. Stockwell 95 


* 
1, 
zg 


Tn 


OH 


D 


` is moved to the left by subsequent transformations: by placing Шен 
after the cyclic syntactic transformations, Bresnan guarantees 1 a Zug 
within the cycle can move an item without changing the location 0 E: 
contour center, but movement rules of the next cycle up, or post-cy! if 
or last-cyclic rules, will change the location of the contour eva 
they move that item to which the contour center has been assigned in 
cle. 
зоб (1968) has argued that constituents which are contour ber 
in the surface structure * must be specially marked for that property a E 
very early stage in the syntactic derivation? (p. 177). His arguments Неја 
to do with anaphoric elements (with reduced stress consequent upon d Le $ 
anaphoric status within the context) and contrastive stress of ker 
types. His arguments are therefore of a very different type from B 
even though he anticipates her conclusion, in this respect. It is remarka 4 
` how little has been accomplished subsequently in the study of the for. 
properties of contrastive stress. There is nothing comparable in detail 0 
` in conviction to the Chomsky-Halle kinds of proposals about the rules 
that govern the placement of neutral stress. à 
Bierwisch does not converge with Bresnan to argue for stress determina 
tion prior to surface structure in the case of neutral (non-contrastive) 
intonation patterns, but only for contrastive ones — or perhaps something 
more subtle, such as topic/comment marking. Emily Pope (1971), howeve" 
does converge by arguing that some syntactic rules that bring about 
deletions must follow rules which assign intonation contours; and the 
assignment of intonation contours, as she sees it, in turn depends on stress 
assignment. Her argument depends on contrasts like the following: 


10. (i) Yes, happily. [= ‘Yes, they are married happily.'] 
Kg 


(ii) Yes, happily. [= “Yes, they are married, I'm happy to say." 


She claims that *the process of intonation assignment is a mapping from 
surface-structure syntactic information to phonetic interpretations’ (P- E 
and that it is a phonological phenomenon. I find some of her argument 
less than persuasive, specifically (a) that intonation assignment дере 
on prior stress assignment, and (b) that intonation assignment rules `!® 
into account brackets but not labels’ (p. 73), in that respect resembling ! 
NSR, which she assumes to be a phonological rule par excellence: 
respect to (a), I merely note that the only way in which intonation depen. 
on prior stress assignment is for determination of the center of the cont? B 
but what contour turns up (e.g. rising v. falling) in no way depends ^. 
the location of the center. It is quite likely that the contour, and its cen 


¢ 
96 Theory 


H 
are altogether independent phenomena; the center perhaps depends on 
such factors as the NSR, emphatic stress marking, contrastive stress, or 
topic/comment marking; and the contour itself depends on factors ` 
sometimes very remote from the surface, such as degree of conviction 


with which an asserted belief is held (‘He "ЗУ py piĉ? апа at other times 
fairly close to the surface and obvious within the derivation, such as the 
yes/no interrogative rising intonation (which can be argued to depend © 
either on the presence of some sort of trigger in the deep structure, or a 

deletable performative, or some special configuration such as WH- - 
either/or, roughly ‘W H-either ће is going, or he is not going’ as source of ` 
‘Is he going?’). The point is that Pope does not establish anything, in 
respect to the relation of intonation to the rules of stress assignment, 
beyond the claim that there is a one-to-one correlation between main 
stress and the center of an intonation contour: a fact which was not in 
dispute. There is ample evidence (e.g. Bolinger, 19582) that the only way. 
main stress is perceived is by virtue of the pitch perturbation that defines 
the center of the contour. In respect to (b), that intonation assignment — 
rules operate only on brackets, ignoring labels, I follow Bierwisch (1966) 
in large part, though I believe but cannot demonstrate here in detail that 
the rules of optional phrasing – those rules which specify the location and "éi 
form of intonational pauses that are not absolutely obligatory – require — 
category labels to bring about correct downgrading to the status of ‘clitic’, 
It will not be an absolute downgrading, but a relative and hierarchical one. 


For instance: 


11. (i) I want to know how she BUILT it. 
(ii) [Slower] I WANT to know HOW she BUILT it. 


12. (i) 1 saw how quickly she BUILT it. 
(ii) [Slower] I SAW how QUICKLY she BUILT it. 


I take it that Ло» in both (11) and (12) has the same node label above it: ` 
for (11) there is a deep structure containing ‘She built it insome manner _ 
and in (12) there is a deep structure containing ‘She built it in a quick 

Manner’, In both cases, W H-attachment to the ‘manner’ adverbial results _ 
in the form how. In both, a slowing down of the sentence introduces | 

Optional pauses and thereby two more intonation contours than appear — 
in the corresponding faster version. Optional phrasing of this sort, as — 
Bierwisch (1966) has demonstrated, is a highly regular phenomenon. It x 
Operates on the general principle that pauses must be introduced between ` 
higher ranking constituents before they are introduced between lower 
tanking ones. The principle has the important qualification that you ignore - 4 


В Robert P. Stockwell 97 `` 


the ranking of any constituent that has been attached as a clitic, intona- 
tionally, to some other constituent. Thus in (11) and (12), the subject 
pronoun is attached as a clitic to the following verb. Therefore the slower 
version of the sentences does not introduce pause between the two highest 
constituents. Pronouns always look-for a prop to support them. They 
are stressable only when the prop has been removed, or when they are 
contiguous with even less able-bodied categories (like prepositions ог 
conjunctions), as in the phrase between you and me. 

Returning now to (11.ii) and (12.ii), and considering examples like 
between you and me at the same time, it would appear to be impossible to 
state the conditions of optional phrasing without reference to category 
labels. Bierwisch has no examples of this type, and I am unable to make his 
rules (which make no reference to category labels) serve for these examples. 

. Pope's paper, then, sets out to show that there are phonological pros 

© cesses, namely intonation assignment, which must precede some syntactic 
transformations. If correct, this would provide another case like Bresnan’s. 
Pope’s claim that intonation assignment precedes some syntactic rules 
seems to be correct up to a point. There is no conceivable way in which the 
intonational contrast between (10.1) and (10.ii) could be assigned on the 
basis of surface syntactic information alone. The surface syntactic infor- 
mation is presumably identical. It follows either that intonation assign- 
ment is not a purely phonological process, or that in the course of the 
derivation of (10.1) and (10.ii) some tag is left behind (when the deletion 
Occurs) to identify the contrast, or that intonation assignment is a phono- 
logical rule that applies before some syntactic rules (Pope’s view). 1 think 
Pope has chosen the least Persuasive of the three possible consequences of 
her evidence. I myself think the first alternative is correct, Some of the 
current leading MIT linguists like Postal, Ross and Lakoff are more likely 
to go along with the second alternative (of which a variant would be ? 
so-called global or trans-derivational Constraint), But I have no evidenc? 
to provide, yet, that would choose between the alternatives. 

Even if Pope's arguments do not establish the position that phonological 
rules can be interspersed with syntactic Ones, Bresnan's case, if solid, 
would do so. But her case is in fact a rather spongy one. Her evidence turns 
out to be either wrong, or internally so inconsistent that one has to reject 
any conclusion based on it. d 

In Kingdon (1958, p. 205) there are pairs like these: 


13. (i) Introduce me to the man you were TALKING to, 
(ii) ГИ lend you that BOOK I was talking about, 


1 1 d 2 
One can dream up indefinitely many examples like (13.1) in which th 
non-contrastive contour center location is on the last accentable item © 


98 Theory & 


the relative dause 17 АП of them should be instances of contrastive stress, 
under Bresnan’s hypothesis (cf. (7) above). George Lakoff (1972) has noted 
comparable examples (p. 286): d 


14. (i) Teddy is the only capitalist I would ever VOTE for. . 
(ii) Teddy is the only CAPITALIST I would ever vote for, 


(14.i) is an especially persuasive counterexample because capitalist is not 
an obvious candidate for anaphora (and therefore unlikely to be destressed), 
Yet it is clear that (14.11), not (14.i), is contrastive: it implies that the speaker 
Would vote for any non-capitalist. i 

Counterexamples like (13) and (14), where Bresnan's hypothesis - 2 
wrongly predicts that the last NP in the relative clause will remain the — 
Contour center after it is fronted, are reinforced by counterexamples to her 
Second class of cases, the direct and indirect questions (8): 


15. (i) He works for a chain of GROCERY stores. 
What chain of grocery stores does he WORK for? 
I asked what chain of grocery stores he WORKED for. 


(ii) He established a nev TRADITION. 
What new tradition did he ESTABLISH? $ 


ог 
What new TRADITION did he establish? E 


(iii) He bought a new dress for someone's WIFE. 
Whose wife did he buy a new DRESS for? 


It appears, in fact, that only a direct object allows a non-contrastive read- — 
ing when it carries the contour center forward — and even that is not always 
Obligatory, as in (15.11). In (15.1) and (15.iii), the object of the preposition, 
even though at contour center when final, clearly cannot carry the contour 
Center to the left, except contrastively. Lakoff (1972) has many examples 
like (15), and he has devised a particularly clever example which demon- 
Strates that if an NP can be read ambiguously as direct object or prepo- 
Sitional object (within, e.g. a benefactive adverb), the direct object reading — 
is preferred if the NP is fronted along with its stress, whereas the other 


11. Bresnan's paper yas the central subject of my seminar on English intonation in ` y 
the winter quarter of 1970-71. During that seminar, my students — in particular Carol * 
Lord — and I discovered these and other counterexamples. At the same time, and in- 4 
dependently, George Lakoff in Michigan was writing the paper to which some of the ` ` 
following discussion is devoted. The degree of our convergence is apparent below. I 
ат most grateful to him for his willingness to make the paper available to me prior to 
Publication, 


Robert Р. Stockwell 99 


is. ia ; E SC E 
Г. ТИТРУ ^ KÉ y т ~ 


d 241 


GI 


reading is preferred if the contour center is retained on the final accentable 
item: 
16. (i) The men are competing for some countries. 


-[ambiguous: the countries may be their potential awards, or merely their 
sponsors] 


(ii) What countries are the men COMPETING for? 
[= Who is sponsoring the men?] 


(iii) What COUNTRIES аге the men competing for? 
[= What countries make up the list of prizes?] 


E n " 
A similar, but less natural, example had occurred to us: 


i 


17. (i) The professor looked over a book. ; 
[ambiguous: he glanced through it, or it was interrupting his direct line of 
vision] 

| (ii) What book did the professor look OVER? 
(iii) What BOOK did the professor look over? 


Such examples establish beyond any doubt that Bresnan's claim is (00 
broad: it is not the case that final contour-center nouns carry the contour 
center to the left with them when they get fronted, They do so only if they 


are direct objects — and even then not always. Lakoff points to such €x- 
amples as (18): 


18. (i) Whose UMBRELLA have I taken? 

[Predicted correctly by Bresnan's hypothesis] 

(ii) Whose book did the reviewer CRITICIZE? 

[Not correctly predicted; somehow the heavier verb affects the decision] 
(iii) Which CAR did he buy? 


[Predicted correctly if we ignore Bresnan's own statement that МР5 with 
which are not supposed to follow her prediction.] 


(iv) Which car did the timid little clerk who works in our office BUY? | 

{I think that the neutral contour would have its center on office, which Р 
what Bresnan would predict if you ignore her qualification about which 
but Lakoff marks it on buy.] 

Lakoff does not have any new light to cast on Bresnan's third class of 


examples (9). He notes that Ј. К. Ross in a class at MIT in 1967 made t e 
correct observation about (19), , 


19. (i) John has PLANS to leave, 
(ii) John has plans to LEA VE. 


100 Theory 


"ransformational rule, bu 


that in (19.i) plans is underlying direct object of leave, and that this fact 
somehow accounts for its stress, But this observation goes back at least 
to Newman (1946), as Ross was no doubt well aware even though it is not 
mentioned by Lakoff. 

Lakoff's own solution (Lakoff, 1972) is apparently adequate to the 
evidence, though unsatisfying because it merely lists a set of curious facts 
Within a global constraint!? and provides no explanatory account of them. 
Theconstraint does, however, block a class of counterexamples to Bresnan's 
hypothesis which we have not dealt with so far, and which Lakoff was the 


first to observe: 

20. (i) It is likely that he'll solve those PROBLEMS. 

Gi) *Which PROBLEMS is it likely that he'll solve? 

(iii) Which problems is it likely that he'll SOLVE? , у 
[Bresnan introduces an irrelevancy here: she excludes examples with which 
from her predictions. But the example is just as damaging with what: * What 
Problems is it likely that he'll SOL VE?'] 


The Lakoff global constraint sets up three conditions under which an NP 
Will be allowed to carry its contour-center-hood forward with it: 


21. (i) In logical structure it is a direct object;** 

ii ing it:14 

(ii) In shallow structure it has no clause-mates following it;'^ and 
(iii) In surface structure it is a clause-mate of its logical predicate. 


Itiscondition (21.iii) which blocks (20.11). Condition (21.11) merely guaran- 
lees that it is final (and therefore subject to the NSR). And condition (21 3) 
is the fundamental condition that distinguishes those examples of Bresnan's 
Which are valid from the classes of counterexamples cited in (13)-(18) 
above. It is also the condition that seems quite ad hoc and non-explanatory 


to me, though I have nothing better to offer. | 
Before leaving the nuclear-stress question, we should look again at 


examples like (6), (9) and (19): 


12. A global constraint is one which applies across two or more stages of a deri- 
Vation; i.e, one which is not statable as a constraint on the operation of a particular 
t must hold across several stages, even across intervening 
Tules to which it is irrelevant. Since a constraint of this form enormously enriches the 
Power of a grammatical theory (and thereby weakens the claims the theory can make), 


ompelling evidence. 


Опе allows it only їп the face of c Н 
s meant is ‘deepest, most abstract representation — 


+ By ‘logical structure’ what i 


hopefully corresponding to the logical semantic structure". | 
14, ‘Shallow structure’ is that level of abstract representation that exists after all but 


last-eyclic and post-cyclic rules have applied. *Clause-mate' has the apparent sense, 
Pamely ‘constituent in the same clause’. 


E Robert P. Stockwell 101 


6. (i) 1 Һауе INSTRUCTIONS to leave. 
(ii) І have instructions to LEA VE. 


9. (i) Helen left DIRECTIONS for George to follow. 
(ii) Helen left directions for George to FOLLOW. 


19. (i) John has PLANS to leave. 
(ii) John has plans to LEAVE. 


Unlike the other classes of examples, there is no quibbling about these. 
Furthermore, the Lakoff constraint (21.ii) does not hold for this class: 


22. (i) I һауе INSTRUCTIONS to leave with Mary. 
(ii) I have INSTRUCTIONS to leave on the airplane, 
(iii) John has PLANS to leave here this afternoon. 


And of course, since Lakoff's constraint (21.1) merely formalizes the 
distributional fact which would allow Bresnan to apply ће NSR to ib Ît 
follows that these examples are somewhat mysterious under either the 
Bresnan hypothesis or the Lakoff global constraint, Lakoff's constraint 
(21.1), that the contour center must be a direct object in logical structures 


would appear to relate these examples to the ordinary relative clause ^ 


interrogative examples. But Why does this class, alone, ignore claus?” 
mates following it, an 


^ 5 
У of which can be contour centers in other for™ 
of the sentences: | 


23. (i) He left the instructions with MAR Y. 
(cf. 224) 


(ii) He left the instructions on the AIRPLANE, 
(cf. 22.) А 


(iii) Не left the plans here this AFTERNOON. 


1 сап offer some weak evidence that the contour center of (6.i), (94) ant 
(19.i) has nothing at all to do with having been an object of the lower ME 
in logical structure (and therefore nothing to do with cyclical applicat 

of the NSR). Consider sentences closely related to (9): 

24. (i) Helen left DIRECTIONS, and George is to FOLLOW the™ 
(8) Helen left DIRECTIONS. George can FOLLOW them (if he wan 
to). : 


(iii) Helen left DIRECTIONS (which George can follow if he wants 0), 


I think (24,10) is closest to exemplifying my proposal: the Jengi, E 
in (9) is the remnant of a fuller parenthesis: the entire parenthesis in C^ ` 


102 Theory 


would normally receive low pitch throughout — I think it is a separate 
contour with follow at the center, marked not necessarily by pitch obtrusion 
but by timing. If there are necessarily two contours in such sentences, 
directions would form the center of the first one (by the ordinary NSR), 
and the pitch drop would be a consequence of a parenthesis rule that is 
needed anyway — which may then, when truncated sufficiently, as іп... 
PLANS to leave, appear to be a destressed final segment of a single 


contour, 


C The boundary question 

Without pretending to do justice to a long and excellent dissertation 
(Downing, 1970), I can outline a hypothesis that has a great deal of 
generality going for it. The interest it has in relation to Bresnan and Pope 
is apparent from the following quotation (p. 204): 

ly phonological effects, they 
ate transformational rules of 
ncluded that independently 
fficient for the operation 
e are determined 
logical rules that 


Although phonological phrase boundaries have on 
Must be assigned prior to the application of certain | 
the syntactic component. Therefore it must be со 
Motivated aspects of syntactic surface structure are not su 
9f phonological rules: some aspects at least of surface structuri 
exclusively by the necessity of providing input to the phonol 
Specify prosodic features . . - 

an and Pope that intonation 
е. He takes as the basis for 
at *a characteristic of root 
notion ‘root sentence’ 
nds’s definition would 


Thus Downing adds to the clamor of Bresn 
Cannot be assigned from surface structure alon 
his own position a claim of Emonds (1970) th 
sentences is to be set off by commas’ (8). Emonds’s 
is somewhat redefined by Downing, because Emo 
Include extraposed clauses, as in (25): 


25. (i) It bothered him that she was intelligent. 
i) 8, 

МР УР, 1 
If root sentences are to be set off by commas (i.e. а ive ‘ 
intonation contour), (25) must be excluded. Downing’s definition, then, is 
that a root sentence is not commanded by a VP node.!* Given this defini- 
tion, he inserts phrase boundaries, which indicate where the intonation 


Contour will start and stop, at both ends of every root sentence. The rule 
at inserts these boundaries applies after the cyclic rules and before certain 


ге to receive a separate 


а 15. The notion ‘command’ in this context means only that the given sentence is not 
Ominated by a VP node nor a sister of a VP node. 


| 
Robert P. Stockwell 103 


E. eme 


\ 
uv 
rf 


x 
i in the in- 
post-cyclic rules. It works without any ad hoc quality to explain t 
tonational pauses in conjoined sentences: 


26. (i) John bought the candy/and Mary ate it. í 
(ii) I told you that John bought the candy and Mary ate it. 


: ich is 
In (26.i) the comma pause is (pretty much) obligatory а fact her. 
explained by Downing's hypothesis, since the conjunction joins t p. 
sentences. In (26.11) no comma pause occurs, because the conjoine he 
tences embedded as object of tell are not root sentences. This is Down E^ 
crucial observation; he then looks at other instances of obligatory Рен fic 
and tries to make them all fit the same hypothesis – a reasonable ME. 
procedure. One always wants to be absolutely forced by one's data V a 
any more machinery to the shop-full that one already has. In VER 
сазе, the procedure leads down an increasingly rocky and hazardous E. 
however, and as he gets further away from basic conjunction, he beco 

Jess and less persuasive. 


The case beyond conjunction that looks best is adverb preposing, 25 y 
(27) (from Downing, 1970, p. 83): 
27. (i) When John phones, the girls talk to him. 
(ii) When John phones the girls, talk to him, 
(iii) When John phones the girls talk to him, 
(27.111) is thrown in only to sho 


is obligatory. But the natural о 
not obligatory: 


? ly 
w that the comma pause, either way» К, 
rder of these adverbs is final, and a pau 


28. (i) The girls talk to him when John phones, 
The girls talk to John when he phones. 


(ii) Talk to John when he phones the girls, 
(2)Talk to him, when John, phones the girls, 


But hold a moment: is it the case that any of the sentences of (28) cà? p 
spoken naturally with only one intonation contour? Downing thinks p 
think it is possible that the sentences are all two-contour sentences, 
the second contour being a low-level one: 


29. (i) The girls ALE, 


9 Joh 
" [ when he PHONES, 


(i) Ta, 
9 John / when ће phones the GIRLS, 


104 Theory 


If this is a correct observation — we will examine тоге data below – notice 
what would follow: there would be no need for insertion of phrase 
boundaries when the adverbial sentence is fronted. Each intonation pattern 
has a boundary in (29): what happens is that the intonation pattern itself 
is changed by the fronting rule, and Downing has interpreted this as 
boundary insertion: 


TALK 


30. (i) When he PHONE” / the girls 10 Yohn, 


n Ст 15 /ТА 
(ii) When be phones the "Jet" / ^L io jane 


We now have two questions: (a) by what device does Downing insert 
Phrase boundaries when adverbial sentences are fronted (assuming that 
they do not have separate intonation patterns when they are not fronted)? 
(b) what kind of evidence will decide whether such adverbs have separate 


intonation patterns in their pristine (unfronted) state? Ms 
Downing's device is to formulate all the relevant transformations as 


attaching the fronted element by means of Chomsky adjunction rather than 
Sister adjunction.1® Thus: 


31. (i) S. 
NP V. 
ADV 
The girls talk to John | 


when he phones 


16. One of the central problems of ‘classical’ transformational theory is that of 


assigning a correct structure to the output of a transformational rule. In ‘sister ad- 
Junction? a node is attached to the left or right of some daughter node (hence the term 


‘sister’), Thus in (31.1), sister adjunction would yield 


doe 
ATN А 
S. thegirls talk to John 


hen he phones 


With the preposed adverb adjoined as left sister of NP. Chomsky adjunction, on the 
Other hand, creates ап additional node by COP YING the node which already dominates 
the node tol which the moved item would be adjoined by sister adjunction, as shown in 


GLi), thus yielding an extra Чауег' of structure. 
Robert P. Stockwell 105 


~ 


OR РТ iy 


МР МР 


‘When he phones the girls talk to John 


Quite simply, by Chomsky adjunction he turns a non-root sentence ie 
root sentence. He is of course honest about the ad hoc character of t 
device (p. 205): 


“It is not possible to predict which particular transformations will employ 
Chomsky adjunction; rather it is necessary to specify Chomsky adjunc- 
tion, sister adjunction, etc., as part of the structural change of each DI 
‘ticular movement transformation. It appears in fact that individuals may 
у employ different types of adjunction (as revealed in phrasing) in what P 

essentially the same transformation, e.g. in adverb preposing.’ 


It appears to me that the device is then circular as well as ad hoc: if YO" 
... know from the surface output that you need a separate intonation patte™ 
set up the relevant rules with Chomsky adjunction. This is equivalent E 
. assigning the separate intonation pattern in the relevant transformati?” 
itself. Chomsky adjunction is the least well-motivated of the several (УР 
Жог elementary transformations, anyway. If intonational facts are use 
decide when it is needed, and if it is employed to explain intonation! 
facts, the circle is complete and unconvincing, 
Let us look at some unfronted adverbial sentences: 


32. 0) When he had finished this TASK у he locked up and went HOME 


(Downing 1970, p.53 — intonation supplied), 


(ii) He locked up and went НО. ei 
E | when he had finished this TAS 


p. 


33. (i) Since you are an old friend of the FAM LY you have a, 


f 


right to KNow з А 
(p. 53, intonation Supplied), 
d ii) You havea ri ht to КМ 
veari ; 
| (ii) You ha E OW since you are an old friend of 
А the FAMILY. 


106 Theory ' 


34. (i) Just as she fired the Pis oL Bill came into the Poo 


(ii) Bill came into the Ro, M just as she fired the Pig 
о Тог, 


35. (i) ‘AM p ANT" 
; P Hilda SAID ‘PREGN 

ANT? 
(ii) Hilda SAID, ‘Am 1 PREGN 


These have been chosen to illustrate the following claims of mine: (a) that 
it is utter nonsense to suppose (with Chomsky-Halle, 1968) that the NSR 
Сап ро right on applying cyclically and reducing the non-main-stressed 
items further and further — the limit of phrase length is rather narrow (the 
same point is made by Bierwisch (1968), though he apparently would go 
much further down the road of 1-2-3-4-5-6-7 stress reduction than I would, 
and he says nothing about the boundary limitation problem that is in- 
extricably tied to the N S R-repetition problem); (b) that pitch-lowering (of . 
the whole embedded contour) is obligatory with parenthetical items like 
(35.1), and this fact is not expressible as а function of phrase-boundary- 
insertion conventions;!? (c) that the number of intonation contours, if you 
grant me level contours in (32) and (33), corresponds, in this class of 
examples, to the number of un-deformed deep sentences. 

О? course, if (с) is correct, and can be extended (e.g. by rules which 
стазе intonation patterns only under specified conditions with the trans- 
formations themselves, so that transformational rules do much the same 
thing in respect to intonation that they do in respect to other aspects of 
Structure, namely reduce depth and eliminate structure in various ways — 
but not build or add structure), then one of my first hypotheses would be 
Partially regenerated (see Stockwell, 1960). But a much wider range of 
Cases has to be examined before such a claim can be supported, and I shall 
Not do so here: I seek only to cast doubt on the kind of approach that 

owning espouses, and encourage research in a less ad hoc direction. 


It should be clear by now that we have not achieved a coherent theory 
ut a great deal of interesting 


ОЁ intonation in relation to syntax yet. P 

Progress has been made, and intonation 15, after several years of neglect, 
d ‚ * i 

Suddenly quite central again in syntactic discussions. 


17. Thi 3 in the traditional literature on intonation 
+ This observation has been made і 3 
Tepeated]y ram Roe hare Here BOLLS though I owe it most recently to Peter Lade- 


Oged i 5 
Bed in the seminar noted earlier- 


Robert P. Stockwell 107 


_ References 

E A., and Szamost, M. (1972), ‘Observations on sententia] stress’, Language, — 
` vol.48, pp. 304-25. у 

| Ваврлутасн, M. (1966), ‘Regeln für die Intonation Deutscher Sätze’, Studia 
Grammatica, vol. 7, pp. 99-201. МЕ 

"Вік wiscH, M. (1968), ‘Two critical problems in accent rules’, J. Linguistics, 

. vol. 4, pp. 173-6. 

BoLiNGER, D. L. (19582), ‘A theory of pitch accent in English’, Word, 

vol. 14, pp. 109-49. 

-BoriNGER, D. L. (1958b), ‘Stress and information’, American Speech, vol. 33, 

_ “рр. 5-20. 

EH D. L. (1972), ‘Accent is predictable (if you're a mind-reader)’, 


Language, vol. 48, pp. 633-44. P 
Boxer, D. L., and GERSTMAN, L. J. (1957), * Disjuncture as a cue to - 
constructs’, Word, vol. 13, pp. 246-55. 


` Bresnan, J. №. (1971), ‘Sentence stress and syntactic transformations’, 
_ Language, vol. 47, no. 2, pp. 257-81. | 
. BRESNAN, J. УУ, (1972), ‘Stress and syntax: а reply’, Language, vol. 48, pp. 326-42. Е 
HOMSKY, №. A., and HALLE, M. (1968), The Sound Pattern of English, NeW 1 
ЖА Harper & Row. EX ` 
.. Cuonsky, N. A., and Luxorr, F. (1956), ‘On accent and juncture in English’, 
in M. Halle (ed.), For Roman Jakobson, Mouton. "mU 
. Downrna, B. T. (1970), ‘Syntactic structure and phonological phrasing in English? 
University of Texas dissertation, unpublished. 
 Emonps, J. E. (1970), ‘Root and structure-preserving transformations’, MIT 
- dissertation, unpublished. 
___Еорок, J. A., and KATZ, J. J. (1964), 
d Prentice-Hall. ] 
GLEASON, H, A. (1955), 
Rinehart & Winston. 


HILL, A. A. (1958), Introduction to Linguistic Structures: From Sound to 
Sentence in English, Harcourt Brace Jovanovich. 


$ 

3 1 

Нил, А.А. (1962), Proceedings of the Third Texas Conference on Problems of d 
Linguistic Analysis in English, 1958, University of Texas Press. 

Носкетт, С. Е. (1958), A Course in Modern Lin, 


Ed iguistics, Macmillan. ^s 
Ноџзеногреқ, F. W. (1957), ‘Accent, juncture, intonation, and my grandfather ` | 


"The structure of language’, 


An Introduction to Descriptive Linguistics, Holt, 


reader', Word, vol. 13, pp. 234-45. 


> Клморок, К. (1958); The Groundwork of English ene Longmans. 
d AN see GER The global nature of the nuclear stress rule’, Language» 
+ LIEBERMAN, P. (1965), ‘On the acoustic basi 
. linguists’, Word, vol. 21, pp. 40-54. 
LIEBERMAN, P. (1967), Intonation, Perception and А A 
— NEWMAN, S. (1946), ‘On the stress system of Окт MIT, 4 = 171-87. " 
__ PIKE K. L. (1945), The Intonation of American English, University of Michiga Pr gp, 
.. Pope, E. (1971), ‘Answers to yes-no questions’, Te AE Inquiry, vol. 2, PP: ECH 
STOCK WELL, В. P. (1960), ‘The place of intonation ina гае grammar d 
. English', Language, vol. 36, pp. 360-67. ad 
STOCKWELL, К. P. (1962), ‘On the analysis of English intonation’, proceedings ` _ 
the Second Texas Conference on Problems of Linguistic Analysis in English. €^ 
А. A, Hill, University of Texas Press. 


vol. 48 


5 of the perception of intonation У 


| 108 Theory 


STOCK WELL, R. P., and Bowen, J. D. (1965), The Sounds of English and Spanish, 
University of Chicago Press. Last 

Srock wzeLL, R. P., Bowen, J. D., and SILVA-FUENZALIDA, I. (1956), ‘Spanish 
juncture and intonation’, Language, vol. 32, pp. 641-65. — 

Tracer, С. L., and 5матн, Н. L. (1951), "Ап outline of English structure’, 
Studies in Linguistics, Occasional Paper No. 1. $ d 

VANDERSLICE, R. (1970), *Occam's razor and the so-called stress cycle’, Language 
Sciences, vol. 13, pp. 9-15 (Indiana University Research Center for the Language 


чи, 


Sciences). i J 
VANDERSLICE, R., and LADEFOGED, Р. (1971), * Binary suprasegmental 


features’, UCLA Working Papers in Phonetics, vol. 17, pp. 6-24. 


à 


6 David Crystal » 


The Intonation System of English 


Adapted from David Crystal, Prosodic Systems and Intonation in English, 
Cambridge University Press, 1969, pp. 195-252. 


The parametric approach 

Intonation is viewed, not as a single systern of contours, levels, etc., but as 
а complex of features from different prosodic systems. These vary in Ше, 
relevance, but the most central are tone, pitch range and loudness, with 


E rhythmicality and tempo closely related. Scholars have been anxious tO 
Е 


d 
" 


^ 


Л 


7 


Testrict the formal definition of intonation to pitch movement alone 
(though occasionally allowing in stress variation as well); but when the 
question of intonational meanings is raised, then criteria other than pit 
are readily referred to as being part of the basis of a semantic effect. 

is a theoretically undesirable situation; either one adopts a relatively nat. 
M definition of the phenomenon, and simplifies the formal description ° 
intonation at the expense of the semantic, ог one allows intonation 
wider definition, with resultant increasing complexity in the formal 848° 
but an ultimately less involved semantic statement. The parametric egi 
proach in principle follows the latter course, but tries to do justice to! 
former by giving priority to those prosodic systems involving pitch mora 
ment (namely, tone and pitch range), in this way, one does not echt 
features from other systems when these are made am of, along with sie? 
to produce a given grammatical, accentual ог attitudinal effect IntonatioP" 
then, refers to a phenomenon Which has a very cle ‘ 
trast, and a periphery of teinforcing (and occasion, 
trasts of a different order. The point at which 
completely subordinated to vocal or non 
is the point at which intonation gives 
systems. 

It has long been realized that, within the prosodic contrasts of English. 
some features are more noticeable and seem to carry more 561" ү 
‘weight’ than others. y 1 

Some intonational categories аге perceptually more distinct o 
linguistically more replicable than others, and this gradation seem A 
correlate with degrees of linguistic importance, It was shown (Quit 
Crystal, 1966) that, when native speakers were presented with the t^ 


ar centre of pitch ch 4 
ally contradicting) a 
pitch contrast sti 
vocal effects of a different па! ај 
Way to other communicat 


ko 


. 110 Theory 


repeating an utterance, there was maximum agreement (84-8 per cent) over 
the location of tone-unit boundaries; agreement over tonicity (the placing 
of the nucleus within the tone group) was 81:6 per cent; onset location (the 
first prominent point in the tone unit) yielded an agreement of 77-3 per cent; 
and the exponent of nucleus (the nucleus syllable) an agreement of 74-4 per 
cent. Within the category of tone (the pitch movement of the tone group, 
Not just the nucleus), it was clear that the polarity was most extreme be- 
tween fall and rise, i.e. the distinction between these had clearest phono- 
logical status, and that the remaining nuclei tended to cluster into two 
groups, depending on whether they were more fall-like or rise-like. Rise- 
fall seemed to relate primarily to falling-type tones; fall-rise and fall-plus- 
rise to rising-type. Generalizing from these and other results, I would 
Postulate a major division of nuclear tones into two types, falling (com- 
prising simple, complex and compound tones, thè final direction of pitch 
Movement being downward in each case) and rising (again comprising 
simple, complex and compound tones, but the final direction of pitch 
Movement being upward). The category of level tone retains an ambiguous 
Status and must be discussed separately. Finally, from the ways in which 
Native speakers reacted differently to the utterance they had as a model – 
Particularly from their misidentifications and substitutions, which showed 
Significant consistency — it becomes evident that what we are dealing with у 
in intonational analysis is not a single system of contrasts increasing in 
delicacy until all contrasts аге accounted for, but a “system of systems’, — à 
interacting in different ways, in different degrees at different places within 1 
the tone unit, у К 

One thing emerges quite clearly: the most readily perceivable, recurrent, ` 
Maximal functional unit to which linguistic meanings can be attached (in 
the present state of our knowledge) is the tone unit. It is the obvious place — 


(0 start any examination of the English intonation system. 


The tone unit Sr: 
To analyse English speech into a sequence of non-overlapping tone units — . 
Means in effect to define their boundaries. In English there seem to be i 
Tegular definable phonological boundaries for tone units in poe speech. 

iven that each tone unit will have one peak of prominence a e form 
Of a nuclear pitch movement (as explained below), then after this nuclear 


tone t К -unit boundary which is indicated by two phonetic ү, 
here will be a tone-uni ue pitch change, either stepping up | 


actors, Fi ill be a регсе! 
Si Stepping. т e D the direction of EC Gs goce ae 
If falling, then step-up; if rising, then step-down; if level, eit s epending 
Оп its relative height, This is due to the fact that the оше ot ip ое unit 
1 а speaker’s utterance is at more ог less the same pitch level. The second 


` David Crystal 111 


j i 
liu е ДУ XO HOT УЧТА 


tuition of ‘completeness’ at the end of the unit: if it is omitted, the auditor? 


criterion is the presence of junctural features at the end of every tone E 
This usually takes the form of a very slight pause, but there are frequen 
accompanying segmental phonetic modifications (variations in ei 
aspiration, etc.) which reinforce this. These phonological criteria suffice у 
indicate unambiguously where a tone-unit boundary should go in con: 
nected speech in the vast majority of cases. а 

There is а general agreement about the internal structure of the хара 
unit in English. Miminally, а tone unit must consist of a syllable, and thi 
syllable must carry a glide of a particular kind. This is the obligatory 
element, and is usually referred to (in the British tradition) as the nucleus 
of the tone unit. The presence of a nucleus is what accounts for our n 


effect is one of ‘being cut short’. Maximally, the tone unit may consist 0 
three other segments: the head, the pre-head and the tail. m 
The head of the unit refers to the stretch of utterance extending fro 
the first stressed and usually pitch-prominent syllable (or onset) UP a 
but not including, the nuclear tone. It consists of an unspecified numbe 
of stressed and unstressed syllables (at least one of the former). he 

The prehead, or pre-onset, refers to any utterance which precedes f 2 
onset syllable within the same tone unit. It consists of an unspecified We. 
ber of unstressed syllables (at least one), but occasionally, under син 
conditions, syllables with some slight degree of stress (not equivalent to x 
iam of the onset syllable, and never with pitch-prominence) may 
there. 

The nuclear tail consists of an unspecified number of stressed 07 K 
stressed syllables (at least one of either) following the nuclear syllä 0, 
usually continuing the pitch movement unbrokenly until the end 9 (2 
tone unit. In such cases, being wholly conditioned by the писјеаг t be 
the tail has no inherent linguistic contrastivity, and only degrees 0 E 
may be distinguished within it. : 


We may now make a characterizatioh of а tone unit's maximal intei" 
structure as being: 


Prehead Head Nucleus Tail 


{ ; еы, : ing 
the only obligatory element being the item in italics. A tone unit accordi g' 


may be internally defined as a structure consisting of one of the follow cy 
P(rehead), H(ead), N(ucleus), T(ail); PHN 


, PN, HN, PNT, HN? 
summarizable as (P)(H)N(T), where brackets include optional elem? 
Tone 


| a 

а e ine? 
Every tone unit contains one and only one nucleus, or peak of promi? Aë 
expounded by one of a finite number of contrasting pitch glides 9 ре? 
tentions on the accentual syllable of the most prominent word. It has- 


112 Theory 


called sentence stress, but this is misleading, as the tone unit is seldom 
co-extensive with a sentence, or even a clause. Tone may be seen from the 
point of view of its placement within the utterance (tonicity) and its 
directional type. Nuclear tones are divided into three main types: simple, 
compound and complex. 


Simple. Here we include three types of unidirectional pitch movement, 
rising, falling and level, the centre of prominence being at the beginning of 
the glide. If there is a tail the direction of the pitch movement is usually sus- 
tained without change throughout. Any such distinction as that made 
between ‘high’ and ‘low’ varieties of simple tone is not thought of as 
basically a question of tonal selection, however, but as a combination of 
relative height from the pitch-range system plus pitch movement – falling 
tone plus relatively high starting-point or relatively low starting-point 
respectively, The ‘high’/‘low’ distinction is thus primarily a matter of 
simple pitch range. The corollary of this is interesting; there are accordingly 
as many contrastive types of simple fall, let us say, as there are contrastive 
degrees of pitch height: as many as seven. This does not force us to dis- 


tinguish systemically seven types of simple fall in English, however. In view 


Of the fact that the simple pitch-range features may occur elsewhere in the 
h the same contrastive force, and 


tone unit than with the nucleus, but wit r шо 
that the same pitch-range feature produces identical contrasts in different 
Nuclear tones, it is clearly more economical to take the pitch-range con- 
trasts separately. Out of seven possible simple pitch-range features, one 
is selected, within any tone unit, which determines the beginning-point of 
the tone, The ‘meaning’ of the tone, if one might put it crudely, is thus a 
Combination of the ‘meaning’ of the simple pitch-range feature selected 


Plus the ‘meaning’ of the nucleus. 

Combining falling tone with the simple pitch-range system does not 
account for all the falling contrasts found in the data, however. There is a 
further independently varying possibility for falling and e 
9f co-occurrence with the complex pitch-range system. ТЕ one examines 
the width of a nuclear tone in the data, three main types become apparent: 
the vast majority of the tones have a fairly consistent width, which we may 
call X; the remainder have a width perceptibly narrower or wider than Х. 

here seems to be only one degree of widening and pi Se 
Contrastive force in English. Once again, however, EH о calling 

A and e three ‘different’ nuclear tones within the tone system, it is 


better, in view of the fact that wide and narrow contrasts i apply to 
i in pitch between one 
1. The si i embraces the differences in pi 
m h-range system d : 
баре апа E The EIE pitch-range system is the width ofa tonal movement 
Widened, narrowed or monotone). 


PH DN beat Ld SERE 


David Crystal 113. 


non-nuclear stretches of utterance, to postulate complex рисћ range as à 
separate system. The majority of tones co-occur with the unmarked 
complex pitch-range feature X, and this may therefore be called the norm. 
The same arguments apply to other nuclear tones. Restricting our- 
. selves to the simple kinetic tones for the moment, we may summarize 
the possibilities of occurrence as follows (0 stands for the unmarked terms 

in the simple and complex pitch-range systems): 


Beginning-point Range Tone 
^ Relatively high} 4,4 Wide (w) ` Falling (‘) 
Medium 4, Normal (Ø) Rising (^) 
Relatively low 0,4, | Narrow (n) & 
LEES 
Eros 
да <> 


These discriminations are made without any reference to the ending-point 


of the nuclear glide. There are relatively few contrasts which can be made 
using the end-point compared with the range of contrasts elsewhere, partly 
because of the formal indeterminacy which exists at the end of a glide- 


is usually immaterial how far a low fall falls, for example, this being largely 
a matter of physiological vocal ran 


У hys ge. It is the relative pitch height of the 
whole tone within the tone unit and the width of the tone which are lin- 


on-glide is much less — ^N as opposed och. Secondly, some of thi 
tones emerging from this classification are much more frequent than | 
normally supposed. A case in point is what one might call the high-™ 
falling tone ^^ _, which occurs often in the normal course of serio" 


conversation. When a speaker is agreeing, usually non-committally (o 4 


114 Тћеогу 


example ‘yes’, ‘hm’, ‘really’), to points being made in a discussion, his 
Tesponse-utterances tend to take this pattern. 


Level tone. In English there is often (actually, in about 8 per cent of all 
cases) clear evidence of a tone-unit boundary, but no audibly kinetic tone 
Preceding. In such a circumstance two courses are open: either one may 
Classify the phenomenon as a further kind of head, or one may call one 
of the preceding level tones nuclear. The weight of evidence seems to force 
the second solution, for the following six reasons. 
1. One of the level tones is always more prominent than the others, and 
equivalent to the prominence of kinetic nuclei; it does not seem possible 
to reduce this prominence and retain an acceptable tone unit. Also, the 
Syllable on which it occurs is lengthened substantially, and there is a clear 
rhythmical break between what precedes and what follows. 
2. This tone nearly always occurs on the last lexical item before the 
Phonetic boundary, and is thus distributionally similar to kinetic tones. 
3. If this tone occurs at the end of a subordinate or correlative grammatical 
Structure, it admits of replacement by (usually) a rising-type tone. For 
€xample:? j 
I [readily айміт this| - that Jif you in'flict “corporal 
‘staccato’ püNishment| - with|in the qualifi'cations that Гуе 
‘lento’ defriNED| - ‘it is |going to • re'form • the THUG|’... 
[crimes of tviolence| in |!GkNeral| - |""decreased ` 

3 {мАккеаіу | * be|tween 'eighteen NiNEty | and *|nineteen 
spiky’ — "thirty FOUR|’ 
Level here seems to be functioning as a ‘marked’ form of rise. 
4. In non-subordinate structures the level tone has a range of meanings 
(boredom, sarcasm, etc.) quite distinct from the types of meaning carried 
by tone-unit heads, and very similar in force to other nuclear semantic 
functions, 
5. A tone unit with a level tone of the above characteristics is acceptable 
to native speakers, and does not sound ‘incomplete’, such as would be the 
Case if a kinetic tone had simply been dropped. Compare the effect of 
Stopping after the word ‘time’ in the following pair of utterances: only 

€ second admits a deliberate pause. 

Псе upon a time there were three bears 


ө ^» 

ө. O је ons 
Se X 
ЫЙ ИЧИК E СҮ СА 


‚‚ 2- Signs for tones: ` falling; ^ rising; " falling-rising; ~ level. The first three аге 
Kinetic’. The | sign before a syllable indicates stress but with no pitch contrast. 


David Crystal 115 


Жн 


zy 
6. Level tone functions in relation to simple. EE Vente. d 
other nuclei, though not of course to complex pitch range. al gap-filler, & 
made in fact for seeing level one as an important E do 2 
i.e. a difference only in degree from the polarities of both fall a WT 
There remains the question of how far level tones may be EID E». 
rises or falls in any general kind of tonal classification. There is hei da 
no real reason why they should be grouped with either, and we cleat 
examines their distribution with reference to a number of criteria, НЫ. 
answer suggests itself: level seems to hold an ambiguous айн e T. 
the categories of rise and fall, and it would be unwise to force it dee. » 
Level is clearly functionally rise-like: (a) in subordinate struc 5 M. 
already mentioned, especially when preceded by a pitch step-up; ( Wi 
it occurs as the exponent of the second element of the fall oe ee 
as the maximal degree of narrowing that the rise can take (very nw a 
in public speaking, for example); (с) when seen from the point О М auses 
its distribution with pause types. Level tones tend to be followed by D a 
of much the same kind and range as rising tones do: rises and levels al 
to have a majority of brief or zero pauses at tone-unit boundar ш 
generally have longer pauses. On the other hand, level tones seem a 
more fall-like when one considers their distribution in respect of seq" t 
of tones between unit pauses — in my data approximately 50 per Mi. 
all falling-type and level tones occurred finally in a sequence, whereas nti 
25 per cent of all rising-type tones did, There are also various seme 
reasons for not associating some occurrences of level tone with E 
tones, for example the tendency to avoid the meanings of por 
sarcasm in subordinate structures (where a rising tone might be uS? ^. 
the basis of such evidence there would seem to be two distinct аси 
level nuclear tone, опе similar to the function of rising-type tones, ш ће! 
similar to falling-type. Any phonological classification of level with гё 
of these would clearly be artificial, consequently it is listed as а SCH 
category of tone in this study. 


n 
of 
ef 


г осіо? 
Complex. Here І include all nuclei where there is a change in the dr e 
of the pitch movement of a kinetic tone within a syllable, and only ait 
maximum of prominence. (Contrast compound tones below.) The ad 
categories are the fall-rise and the rise-fall, but both rise—fall 1° og. 
fall-rise-fall occur. The first element of the У and ^ is phonetical J Lë 
prominent than the second, and the second element of the rise- y | 
and fall-rise-fall is phonetically more Prominent than the thit ji 
placement of the prominence varies somewhat: one finds both 
Ze AV and 2 in English. All these tones fall under the 1 
co-occurrence already described in relation to simple tone, but 


116 Theory à 


ponent of markedness is different from simple tones, and there are slightly 
more possibilities for contrast. The phonetic form of the unmarked У and ^, 
it is generally agreed, is Y; and 4 respectively (prominence not being 
indicated here), i.e. the beginning- and end-points are not on the same 
pitch level; and this relation is stable regardless of the pitch height at the 


beginning of the glide. As far as complex pitch range is concerned, the | 


Possibilities are increased by varying this final element. Thus the whole of 
the tone may be narrowed (п 67, n° У77), or with the fall-rise the 


Second element only may be narrowed (“л D. ). Again, the whole of the 
tone may be widened Kaf Д не] ) or only one element (namely 
“w ~/ „ли Dal апа, уегу гагеју, у: 


Compound. These tones, also called correlative ог binuclear tones, are 
Combinations of two kinetic elements of different major phonetic types 
acting as a single tonal unit. The main types of compound are "77 and 
^**. The two elements of a complex tone have in effect been separated to 
allow a larger stretch of utterance to fall under the semantic range of the 
Nucleus, It is necessary to review the evidence in favour of taking a sequence 
Of kinetic elements as a formal and functional unit. For this to be per- 
Thissible at least four phonetic and distributional characteristics must be 
Present, 


centric’ relationship, i.e.’ 


1. The kinetic tones must display an ‘endo A 
s of tone units are 


P D 
bu etc., but not ~, ^, ^^, etc. *Exocentric' sequence: 


> 


interpreted as either separate or subordinate. 


2. There must be no evidence of a tone-unit boundary between the tones. 
The syllables between the two kinetic elements must display an evenness 
i Д б e 
9f pitch pattern, continuing the pitch movement in a ‘trough’ or sustained 


Ate from one to the other forexample Nowe AO у От Баз а 


Secondary effect of usually making the beginning point of the second ele- 


Ment lower than what would be normal for the beginning of a new tone 
Unit.) This internuclear stretch is rarely interrupted by a pause, but it may 

isplay some variation in pitch movement, so long as the general tendency is 
Maintained, Similarly, it is rare to find the internuclear stretch (even when 


Itis fai i even words) interrupted by the introduction 
rly long - say, of six or s uistic feature, other than one which 


9r conclusio i aling 

d n of a prosodic ог рага! : 
Teinforces the ЖО; characteristic. In both types of tone there is a 
Strong rhythmic unity between the elements. the internuclear syllables 
tending i5 be articulated relatively quickly in each case, and usually 


Iochronougly, 


| { the 
3. One element of the compound tone must be more prominent UE 
other, otherwise the analyst will tend to take the deenen? A 
two separate tone units. The phonetically dominant pene eg 
first. It is possible for strong stress to occur on the secon ps pet. 
\+/ but here there tends to be a certain balance of pitch range, the form 
uen being wider than the second. There is a stronger pesci dn Ж гг 
stress to occur оп the second element of a “+ not all such Sage, o6 
compensating pitch-ramge characteristics: these must conseq 
taken as exceptional from the point of view of tonicity. 


inetic 
4. Despite the phonetic prominence associated with the first Co 
element, it is the second which is the major functional element, we. 
basis on which the tone is classified. There is no formal confusion tod 
compound and complex tones: the ‘trough’ characteristic of = oft 
(contra the gradual rise or fall of syllables in the second elemen сак 0 
У and ^ respectively), its double prominence (contra the single Presses 
prominence in the latter), and its inability to weaken or suppress 5 tweet 
after the first kinetic element, would seem to suffice to distinguish be dee 
all bidirectional nuclei. While admitting the existence of formal 9 ose 
lapping between complex and compound tones (i.e. an utterance M. 
phonetic shape was ambiguous as to whether it was an instance of Tat 
the other), in the majority of cases in English it seems possible (0 ide 


ic 
dA one" 
а bidirectional nucleus as either compound or complex on Ke a 
grounds; the few examples of ambiguity which exist do not JU nal 
theoretical treatment whic 


to 

h takes them as variant forms of the ot eno 
category. Moreover, while from the semantic point of view it is wel Е 

that substantial overlap exists, and that this is probably greate any 


ate 
between any other two tonal types, it would be wrong to postula ut ? 
kind of semantic identity here, and suggest that 5+7, Jet us say, 5 ©. јр 
distributional variant of У. 


3 ei 
+ While there are many cases where ther 
effect ‘tonal synonymy’, for example, 

А sel, 
I'm [sóRry about the 1вбоксаѕе| and I’m sorry about the ївбокса 


asl 
there are a large number of examples displaying clear semantic con 
(as with |"vóu don't 'know| (Well, who does, then 1) v, | 'vóu don t oust! 
(so why are you saying you do!); I |тноџонт it would rain] v. 1 |t am 
it would RAIN|), and even where the semantic effect is basically the ay 
in both cases, its distribution over the lexical items in the tone uni! E. D 
80 different as to really demand a Separate description (for sett, ай, 
Min said he'd tcome| v. the man said he'd {сӧме]). The Der сё 
linguistic contrast between complex and compound tones thu Ў 
sufficiently great to justify separate discussion, 


118 Theory 


Nuclear tail 

Syllables may follow the nucleus, to form a ‘tail’ of pitch movement. Tails 
in English are usually non-distinctive, their pitch-contours being auto- 
matically determined by the direction of the nuclear tone: in other words, 
they are not normally independently variable. But it would be wrong to 
deny any contrastivity at all to them, for occasionally linguistically sig- 
nificant variation may occur. Excluding level tone, where variation does 
Dot exist (any departure from level pitch movement in the tail being 
immediately interpretable as narrowed rising or falling), there are the 
following possibilities: | 
1. Tail continues the direction of the nucleus in an unbroken fall or rise, 


namely % ier This is by far the most frequent pattern. Stressed 


D 


> 


Е h ! above and before a syllable, as — 


syllables in the tail are indicated wit 
illustrated below. j 
2. Tail begins by continuing nuclear direction, and then levels out. With 
falling tones, this may occur for two reasons: either it is an ШОИ of 
type 1, due to one’s articulating a fall near the bottom of one's voice- 
range and thus being forced to level out, or it is an attitudinally marked 
form of tail, communicating such a range of attitudes as irony, sarcasm or 
boredom. Stressed syllables in the tail are indicated with — above and 


before a syllable. Compare: 


She's а |nzautiful отап]. and She'sa pkAutiful > woman 


Был di AUIS 0 ere chy Oo 


ke? е LJ e 
"Eet ir ч 
Tt may occur when one is near the 


With risin flattening is rare. à à 
REED led questions using a 
to ae ith extremely puzzled qu g 
ti of one's voice-range, as wit 
igh’ rise, for example: 


Hino Ta 2 
sx (yt 
SU ln Ng 0: 2 


ог when the tone is narrowed, for example: 


а ` 
EE не " 
have tails with stressed syllables, 


Less than 10 per cent of all nuclei d the generalization that tonicity 


; јоп, ап 
ahis is a ly low proportion. iable one. : 
falls од dert item is therefore а most ip E 


David Crystal 119 
у 


ein" eds V 
‘We may now summarize the English tone system in a single table: 
Table 1 Summary of English tone systems 
x 
Simple Complex Compound 
Basic types NEE Y^ S£ PN 
Secondary types 20 e пе. | УЖА 
iT 
M M Џ Hi 8 
Simple pitch range у tone La tion => + both elemen 
' 9,44 0, 44> 9,44 
Complex pitch range н\н nd "^ ду HIE" etc. 
у ww пл wrt’ etc. 
"n^w,"w Ner “ет ` 
Nee њу 
Kë wrt w m — 


1005 
, Frequency of occurrence for the basic types is interesting, the proportion 
(as percentages) in my data being as follows: 


H 0077 М; \+/ D = HN 
(51:2) (20:8) (8-5) (77) (5:2) (49) (17)3 
` Classification of heads - 


nt t0 


The head of the tone unit is probably the most complex SÉ ш 


describe, and probably least study has been made of it. It is that 
pendently Variable part of the tone unit stretching from and including 
first stressed and usually pitch-prominent syllable (here referred to as is 
onset) and extending as far as but not including the nuclear segment- у 
ап optional element in the tone unit, but in fact it occurs in an eXU* 
high proportion of tone units in English – about 70 per cent of the 
in my data, Length of tone unit is closely tied to length of head, 27 he 
this respect — overall length — the head is the most variable element I^ e 
unit: in my data, instances of heads of one to thirty and more S " 
(20+ institutional words) occurred. ach 
The principle of description here is to delineate the contour of рї 
movement over the head, by defining the pitch level of each syllab e od 
terms of the level of the syllable preceding. The onset syllable is y. ШИ 
course defined in this way. This syllable is taken as given, being defin? 


mel ^3 
time 
in 


^ A 
3. Cf. Quirk et al. (1964), where the Proportions аге“ 52.5, 7 24.7, * ^ 9:3, M 69, је: 

7271,7 ** 0-6. Davy (1968) finds a significantly different Него conversa d 

opposed to reading, this is primarily due to the higher proportion of rising-tYP 

in the latter, namely: 

for conversation: ` 58-7, ^ 16:1, 7 8-0, У 7-4, ^7 5:1, 742, ^ +` 0:4; 

and for reading: ` 50:2, 24-6, У 11-1, 7 55, 5+7 5.5,^ 24, ^+\ 0-6. 


120 Theory 


а relatively absolute way for each speaker as being the pitch level towards, 
which one automatically tends to return for the commencement of a new’ , 
tone unit, unless a specific attitude requires extra pitch height or depth at , 
this point to make its effect. Ascertaining the level of onset (or nuclear 
tone beginning, where there is no head) for each tone unit is in fact the 
only way in which general references to ‘normal’ pitch height of utterance 
can be made precise. 

Head patterns in English are classifiable into two major types, falling 
and rising, the criterion in each case being how the head begins. The 
Pattern at the beginning of the head may be reinforced or modified by the 
Pattern in the middle, and the effect derived from the juxtaposition of these 
two will be in turn modified by the pattern at the end. The head is an 
extremely flexible segment, making available a wide range of linguistic 
contrasts. One cannot reduce all occurrences of head to two or three 
‘basic? types without a great deal of simplification and distortion. 

I illustrate the following head patterns using a falling tone as nucleus, 
though of course any other tone would do equally well. 


1. Falling heads. There are four main types, which I have designated 
^; B, C and D, 


А. This head comprises a descending series of stressed syllables, Sev 
with intervening unstressed syllables (these being optional); the AER 
Syllables аге always lower than the preceding (stressed or d 
Syllable, Typical patterns, transcribed in phonetic interlinear and tonetic 


transcriptions, are as follows: 


el ом ony on, 
<_<. Se 

ә. SE 

SA 

SE EN Marin IANUE vc 
ai torn" M Lp tot mx 
Ee AE EE - 

LU DTE P g E 
а = 


es of stressed syllables (with or 


B. Thi i nding seri 
SW ошире bibo b some or all of which are higher 


ithout intervening unstressed syllables), | 
Ап, ог БЕСЕ ДАП at the same pitch as the preceding syllable, but none 


9f which is higher than the next previous pitch-prominent syllable or 


Onset, These have been further classified into two types: ` 
BL, heads e beginning of the nuclear glide marked in pitch range; 


» heads with the beginning of the nuclear glide not marked in pitch 
"апре, 


David Crystal 121 


ҮС МИ 25 


А Р E. 
Des Et + wo 2008 
9 ^ wënnt" 
$ e Cos TS A es (ech ` 
DA 
oy Тог Di >~ B2 bj] (0, tor" fs en E 7 
ө .- | 
холу are ` CS 
; А і tresseð 
C. This head comprises a descending series of stressed and uns | 


£ ith the 
syllables, including any of the variations allowed under B, but ke | 


additions that (a) if there are stressed syllables between onset and n st 
the first must be lower than the onset, and (b) the nuclear tone d Ae 

. substantially pitch prominent. A subclassification of these can be po 
depending on the direction of the pitch change which causes De 
minence: cl, nucleus higher than the preceding pitch-prominent sy | 
C2, nucleus lower than the expected normal step-down. 


стар тот ms тр тот mt en my 4 

——$<$=$$ > Kess 

ЛЕШЕ T. EN 
"m 


" em Ton NU WK bi Um "om шм d | 
—————E | ШЕШ ——— a—aá— nata 4 
е сы ка го Ca 
"H ID * НЕ У 


8 


D. This head comprises а sudden drop following the onset syllable |, 


which two Possibilities exist: D1, the pitch movement continues to 
D2, the pitch movement rises at some point, 


Dim) w Din, D2 ( 
D Kr wee 
E, 
e 
• ° е $ о o ^ 
| 
2. Rising heads. Two main categories 
which I shall refer to as E and r. 


‚леш _ 
of rising head may be distin / 
ith? 
E. This head comprises a rising series of stressed syllables, with огу у 
intervening unstressed syllables, each stressed syllable being ш Tul 
or occasionally at the same pitch as the preceding pitch-promine? of! 
Again, a twofold subclassification seems useful: £1, the beginnin? yg 
nuclear glide is marked in pitch range; E2, the beginning of Фе 3 
glide is not marked in pitch range, 1 


SS 


122 Theory 


у 


E “| Wi ~ e mi: 


©, e 
on оор бох Em шо aD ~ 
уз" eee tA 


sl This head begins with one or more rising stressed syllables, and then it 
=> to rise. Pitch may then: F1, fall directly to a lower degree of pitch 
Prominence (usually a booster, but occasionally a continuance or drop), 


Hat nt mt me ep mt _ ON 

9757-9 D» Ээн deeg ee 
E АЁ o 1ш 
remi an, My порт UN Ез an, E mm me 
$7? soe? л 


€ 9-7. э 
Gen кш ————— 
2, fall ultimately (via a series of 
ked pitch range at the nuclear 
o high pitch prominence 


Ge nucleus being optionally prominent; F 
Hite ee lower pitch levels) to unmarke 
(Whi е; F3, fall as in F1 or F2 before returning t 

ich may then continue as a ‘secondary’ rise). 


eu Telling-rising (falling) heads. The head begins b the fi 
or Pooky. described above, then there is а change in the general direction 
the pitch movement, as in the second category; this may then be fol- 
wed by a further change in direction, as in the first category, and so on. 


ete Were few instances of this in my data, but examples collected suggest 
not be premature: Gl, the first 


d at a threefold subclassification тау : r 

hange in direction (which is the important опе) may be а Kg sec 

«Почаїв continuance) with pitch-prominent nucleus; G2, it may be 

Xtremely jerky, but still a general rising movement, and end with a pitch- 
pie ihent nucleus; 03, it may be jerky, as G2, but with the nucleus not 
itch Prominent. 

КТ оп macam б, 07 (DN. 
| wm ср «рх 520) _ my mm "CQ e cada 
exer» ЭЗ ev? Ө 

EUR DOE у ы === 

s the reverse of category 3. The head 

general direction of pitch 


by falling, as in the first 


e. "y €. S 


4. Ris; M 
Rising- falling(-rising) heads. This 15 "C 
Bins by rising, then there is а change 77 the 


David Crystal 123 


movement, which may be followed by a further change, and so К, 
theoretically continuing indefinitely. The nucleus may or may not De 
pitch prominent, but there were insufficient examples to justify even & 
tentative subclassification here. This pattern is referred to as type н. 
ш ш vn UL ш Um wor ш Ri ont [3 
CIEL LEES 3002520 Lu M LA, o 

= ES 
eo? S Bëss a ө TT? z-o 9 SPP 
эл > == 27 


TLLA 


The normal distribution of head patterns in English is much mor 
complex than is normally allowed for, and the number of ‘basic type à 
head is also more than is usually pointed out. The norm as defined у 
Kingdon (1958, р. 3) is: ‘An analysis of English intonation shows tha 

his consists basically of a slowly descending series of level tones usu 
starting at or near the top of the normal voice range and finishing E 
near the bottom.’ However, if one examines the head types in ordeum 
frequency, it is clear that the majority do not display a gradual descend! | 
Series in any sense. The only heads which are gradual and descending i | 
Strict sense are grouped under A; but against these one has (0 weigh 
the rising heads (E and F), the ‘complex’ heads (с and n) and the ZA 
number of heads which display incidental rises and drops through 
their length (B, c and D). A is certainly the most frequently occurring Co 
category of head (about 30 per cent of all occurrences in my data) 
this makes it a norm only in a very weak sense. HR 
The reasons for this misemphasis on A-type heads would seem H d 
lack of study of spontaneous connected speech in the earlier peri К 
intonation analysis, The ‘ideal’ stepping head may well be character in 
of some kinds of written English being read aloud, or of ‘set’ examp! 
a pedagogical context, but it is not common outside such contexts: | of 
More prominence and precision should be given to the notic у 
‘accidental rise’, ie. the pitch-prominent marked features of $ an 
syllabic pitch range which may occur throughout the length of a head arl 
which, incidentally, are not ‘accidental’ in any sense, nor do they neces Wé 
rise. Their function seems twofold: to spread relative prominence OV hel 
words in the head, and to add prosodic variety to connected SEET ` 
occur in about 80 per cent of all heads, t 

Finally, there is the question as to whether certain types of nead o) 
to co-occur with specific nuclear tones. There is only one major Gr ail 
namely, that the higher the head ends, the less likely it is for Ё ч E 
direction of the nucleus to be rising, and vice versa. For example, ' ^j 
in either a high or extra-high booster, most often with , У am" ; sl 
often with ^ ^ and “+ `; G3, where the nucleus begins low, co-0% 


124 Theory 


—_— 


rising heads frequently. Apart from this, the distribution of head types in 
Tespect of categories of nuclear tone shows no significant pattern. 


Preheads 
The prehead of a tone unit comprises all syllables before the onset syllable. 
These are normally very few: in my data, the maximum number was five 
(four words). All such syllables are unstressed; the only occasion when a 
degree of stress is perceivable is when a noun, verb, adjective or adverb is 
brought into the prehead: itis then pronounced with slight ‘inherent’ stress, 
So that it is louder than the surrounding unstressed syllables, but it 
Temains on the same pitch level as these, i.e. it is not pitch prominent. 
The possibilities for pitch contrast are very limited: four areas of pitch 
height can be clearly identified and there is some evidence for distinguishing 
a fifth. The norm of prehead pitch is a level a little below that of the onset 


Syllable, for example: 
We should [rikz to| 
Ces, 
- e ^ 
= SS 
j The unstressed syllables may be level ог шта 
Pitch level of the beginning of the onset syllab! 


transcription, 
The remaining marked levels are as 


y rise gradually towards the 
le. This level is unmarked in 


follows: 
1. high prehead (for example "the): the unstressed syllables are perceivably 
igher than the onset syllable, for example: 


Ber. че н ; 
nstressed syllables are very 
3 *Xtra-high prehead (for example u 


=the): the ` 
HORE е top of the voice-range, for 


Much higher than the onset, usually near ш 
“атре; 
понаша, 
ndency for the first syllable of the 


WA , A Sy 
зад апдан Ш e 
aS the main auditory cue to accent in this ESI = а the other (usually the 
En, of two unstressed syllables, One 1% dud Я 
“COnd) is high, for example: this 8 the [third . e Ви ви 
` mid prehead (for example .the): the unstress' 


David Crystal 195 


being at the same pitch level as the onset; relative prominence is thus Se 
mainly on loudness, for example: 


e e ~ 


de 
4. The remaining possibility is for an extra-low prehead бог ae Ё 
-the), where the unstressed syllables are below normal low level, 
example: 


9 о SN ° 


її 

Here the pitch of the prehead is very often lower than that of the end-pol* 
of the tone unit. не TA 

The grammatical structure of the prehead is largely predicta! Й all 
items which occur there are nearly always from the class of Reie? те 
words’ though non-grammatical items of certain types may be foun pina 
grammatical constitution of the prehead is in fact restricted to com се 
tions (subject to normal grammatical rules of order, which do not сов d 
us here) of conjunctions, Prepositions, pronouns, determines 
auxiliary verbs, with the occasional interpolation of an introdu " 
adverbial, a parenthetic verbal group (such as ‘I think’), or a part и 
nominal group (‘only’, ‘Mister’), 


Inter-tone-unit relations 


Tone units do not exist in isolati 
speech. As soon as tone units be; 
the question of what might be 
to which the formal co. 
tions, 


nec 


on, but work in sequences in СОЛ E 
cil 


gin to be juxtaposed, one has Wi. 
called ‘tonal collocation’, i.e. t esti” 
"occurrence of tones displays predictable 


o 
Most scholars have approached the question of tone-unit sequor, 
om a wholly grammatical angle, first defining а grammatical sem ot 
and then proceeding to an examination of the ways in which this SC 109 
carries a restricted range of intonation patterns. Clearly there are GE рої 
to the problem, the grammatical and the phonological, but the a ^ cy 
of view has been almost completely ignored, What are the ree dë 
patterns of tone-unit sequence in connected speech? Until one 
some answer to this question Phonologically, one lacks the # 
assess the degree of prosodic "uniqueness? of any given n" e 
structure, and thus loses a great deal of the predictive power of olo у 
criptivestatements about co-occurrence which might be made. Pho? ai 
sequences of а fairly restricted type do 


5 гай ug 
exist independently of 87 i 14 
both within major grammatical structures and, less frequently: 


(0 
УШ 
jl 


pilit 
тё 


126 Theory. 


Structures, and one ought to be aware of the more important tendencies at 
Work here before embarking upon any process of grammatical integration, 


Sequence of tone units 

To discover statistical preferences for the use of specific phonological 
Sequences of tone units, it is first necessary to determine how to define, 
the unit sequences to be compared. There are two possible approaches: one 
тау arbitrarily take sequences in a fixed number (say in pairs, or threes), 
Seeing what recurrent patterns exist, or one may utilize phonological 
features other than pitch as boundary markers, such as pause, or other 
Prosodic features. Both these approaches are explored below. 

The obvious place to begin was to take tone units arbitrarily in pairs 
Cdi-sequences ?), to see whether there were any significant tendencies. In 
View of the fact that quite a large proportion of the tone units in the data 
Consisted of nucleus (+ tail) alone, attention was focused on the nuclear 
Segment in the first instance, while not ruling out the possibility of other 
Segments in larger units also having a recurrent sequential pattern. The 
question was asked: given a nuclear tone of type X, what is the probability 
Of nuclear tone of type Y being the obligatory element in the next following 
tone unit? In other words, I was examining progressive influence of tone 
Units on each other, not regressive (which does not seem to be linguistically 
Televant), To ensure total coverage, pairs of tone units (some 5000 in all) 
Vere studied in overlapping sequences, for example: 

A-B 
B-C 

C—D, etc, 

Final tone units in utterances (i.e. preceding a change in speaker or SUE 

Were not taken as being influential in the same sense as within шшс 
* relationship between intonation patterns at the end of i = er ш 

^t (ће beginning of the next would зеет to pose problems of a quite 


differe; i d here. 
nt order from those discusse! . у 
he general totals of frequency of co-occurrence were subjected to 


analysis using the x? test to assess degrees of statistical үш This 
Was to obtain some indication of the gradation of wg 5 y Је ез ироп 
Doch other by quantifying degrees of probability, so Es M сок opi 
that X is more or less related to Y than to А, В, С,... Ba he a | 

ang /+ ~ апа У and ^ * / were grouped together, to SIE St ci 3 
Of occurrence, ~ as first element in a pair Was omitted, in view of its genera 

Tequency: zi it occurred in one out of every two ee ee 
Statistical tendencies for this category would have been та 


© 
Огриз of this size, 


David Crystal 127 


P. in my data in terms of a gradation of decreasing probability. 


_ Table 2 Gradation of progressive influence of tone upon tone in the data 1 


Probability (of Tone 2 occurring 
Tone 1 influences Tone 2 as opposed to any other tone) 


TRU. у 833 
SC Y 79:5 
D 
4 
5 
6 
7 
8 
9 
10 
1 
om | 
3 H 
40:7 
4 M 344 
; 271 
| 267 
110 4 25:3 
o M 178 
p Я 162 
57 


Here the i ti icati j i 
importance of ‘tonal reduplication’, the extent to which t° Le 


Ve 

Aas? ko n Sequence, emerges very clearly: it is obvious Dag 
ice for any notion of tone uni ion lies i repe of 

of tones of the same category SE пка ао E 


i à and not from a combination of (006 
different Categories, In particular, the dominance of the ^* ^ seque 
explains why this pattern has 


been given such fi ion pr 
3 e а requent mentio: 
cussion of intonation; moreover, it Occurs over a vid range of HI 
matical contexts. : 


The nature of the decrease in probabi 
down the table is also illuminating, as it reflects the limitations 0f “ 
cohesion’ in English as a whole, by suggesting where the least 1100 
areas of mutual influence lie. There is a significant gap between the OË A 

of pairs по. 12 and 13: above this point all probabilities are greater t 
_ 50 per cent; below this point there is a sharp drop to very low ave 
d d · indeed. It is also significant that below this point the only tones d 
^ H 
a | 


lity of co-occurrence as one ™ g 


128 Theory 


оссиг as first element in a sequence are rise-falls and levels: we may thus 
conclude that these tones exert least influence on following tones of any | 
other kind, and that the linguistically more interesting tones that exert a 
general influence on anything which follows are the rise and fall-rise. 

An extension of this method to longer sequences of tone units is of 
doubtful value because of the limitations of the corpus and the lack of 
grammatical perspective. As far as the first problem is concerned, it is 
Clear that the longer the sequence, the more data one needs to obtain an 
adequate sample of all but the most frequently occurring tones. Even if 
One restricts one's attention to tri-sequences of tone units, one finds that 
the possible number of sequences is increased from 49 to 343 (7 x 7 x 7), 
and the scatter of sequences which occur less than ten times is substantial: 
in fact the only frequent tri-sequences are combinations of simple rise and 
fall, as one might expect – `+ ^* s, У TES, EE 
^* ^* and ^+ + / (the latter being most frequent of all). There are 
Certain interesting sequences of complex and compound tones, but these 
арреаг only as minor tendencies, and in fact it becomes simpler to list 
them than to classify them. Such patterns are in any case intuitively obvious. 

Apart from this practical reason, the absence of any non-reduplicative 
Sequences of three or more units’ length in the present data is another 
Teason for suggesting that the di-sequence approach is the most useful in 


this fi É hypothesis on this basis, therefore, that 
SR, We dE units, and that while TU1 has a 


5 ; : 
ge fend to vt yim SN? on ТОЗ; TU2 influences the choice 
influence on TU2, it has none on i 
ofT T is hoped on some future occasion to 

U3 but not TU4; and so on. It j ld throw some furth 
Sstàblish informant reaction techniques nicus m i t S 
'ght on this matter. It is clearly an issue of central linguis d eae SES 
Telating as it does to the nature of the creative process in the production 
language, i 

„Тһе Gre approach, that of using SE К ШЕЕ ps 
Pitch to delimit sequences of units, is not at sae for the inability 
One thing there are so few of them. But the se honologically defined 
Of the analyst to extract useful information pud o that the sequences 
Onger sequences of tone units is that he has no 20 fact linguistically com- 
Within each inter-pausal stretch, let us 52У, а E ammatical structures 
Parable. Between pauses there is à diversity 0 Бе nce of a tone is being 
апа there is no way of knowing whether any ER analysis of tone 
«ced in a relatively abnormal way ог Doc EE without а gram- 
Sequences of any length is clearly of ае D dante means of gram- 
айса] frame of reference of some kind. г sequences, it becomes 
Matica] delimitation and definition for e SS the statistics. The longer 

"Dossible to draw any linguistic conclusions 110 
David Crystal 129 


31 


the sequence the more one requires grammatical clarification of the 
structures ‘carrying’ the tone units, Otherwise, one is in danger of taking 
sequences as identical on the basis of a phonological surface structure, 
when in fact they perform totally different functions; for example, in the 
inter-pausal example, ^ ~ could be either two subordinate clauses plus а 
main clause, or a series of three sentences rapidly delivered — to take just 
two possibilities. At such a point analysis made without reference (0 
grammar becomes artificial. Once the grammatical influence on longer 
sequences has been defined, however, it is then possible to go back an 
make a formal study of the non-grammatical residue of sequential informa- 
tion from a wholly phonological standpoint. A great deal more data than 
that used here would be necessary for this to be successful, and the only 
realistic way of approaching the problem would be via a computer. Further 
study of sequence of tones, therefore, or of other sequential parts of tone 
unit structure, should not be made without reference to grammar. 


Theory of subordination 


The theory of subordination presented here is essentially that outlined 
in Crystal and Quirk (1964). The primary characteristic of the subordinate 
tone unit is that its pitch contour, while having а complete and indepen еп 
shape within itself, falls broadly within the total contour presented in d 

Superordinate tone unit. It may precede ('preposed* subordination) 

follow (postposed* subordination) the superordinate nucleus, singly ^t 
in combination with other subordinate units having the same kind 0 


S s һ е 
Systematization. To determine whether one of the two neighbouring 8 


units is superordinate (TU1) or subordinate (1102), the following р 
criteria are used: ; 


1. The nuclear t € postulated as subordinate must repeat the direction Д 
the nucleus in TUI, both nuclei being one of the two primary catego 
fall or rise. If this direction is not similar, subordination is not possibl 
and the tone units must be treated as independent. Complex tones TE 


а treatment based on their potential relationship with one or other 0 
two main categories (see below). 


e 


2. The width of nuclear movement in TU1 must ђе greater than that i 
TU2. The range disparity between the nuclear tones is the main facto! 1 | 
determining the subordinate partner, degree of stress being seco” К] 
The types of subordination reviewed below are based on the Ki” E "m 
degree of this disparity, which is perceived by comparing the statt 
points of the kinetic tones in TU1 and TU2. There are three main P go 
sibilities: either (а) TU2 will start and finish completely outside the n m) 
of TUI; ог (b) there will be an overlap; or (c) TU2 will fall сотр е 


130 Theory ў $4 


within the range of TUI. It may well be necessary to take the latter two 
categories together, as little contrast seems to exist between them: they 
are closely correlatable in form and function, and the main contrast is 
undoubtedly between these and (a) above. 

It is usual to find a correlation between an increase in pitch width and an 
increase in loudness, though this does not affect the decision as to the type 
of subordination involved. In my corpus TU) was regularly more pro- 
minent than T U2, most of the difference being due to pitch increase or a 
Combination of pitch increase plus stress. There were a few cases (about 
10 per cent) where a subordinate unit (diagnosed by pitch width) had more 
Prominence than the superordinate unit (though this was rarely due to 
Strong stress or high booster). But since it is very much more usual to find 


Subordination corresponding to reduced pitch width, reduced pitch range — . 
and reduced loudness, it seems reasonable to make the last two of these 
e case of level nuclei, where 


diagnostic in general, and particularly in the 


Ditch width i ition inapplicable (see below). 
idth is by definition Шарр preposed subordination. Here, there · 


A very similar situation existed for sed st ER 
Was a greater variety of co-occurrences with pitch-range features within a 
the subordinate unit (presumably because Ше Eet чш me 28 
head), but (ће same tendencies appeared here 25 77076. EE 
Prominent than TU2 in about 60 per cent бш која it t 

Subordination would thus seem to pe s fe È x We ti ec 
Certain ‘favourite? configurations of pitch/s mer Sa 

cal word of a phrase, and 


that if the nu the last lexi 
i cleus does not fall on x ) ап. 
if the nucleus is pitch prominent in simple pitch range (particularly if it 


CO-occurs with an onset or high booster) or if it top Pd Saar? un 
t is highly probable that the final lexical item ЕЯ та pd 
Tare for the subordinate nucleus itself to fall on hate and super- 
item, or for other lexical items to CT. between 9; SAE units НА TER 
Ordinate nuclei — 70 per cent of all postposed subo: 


h г sition and any intervening utterance 
ce the nucleus occurring si Eia s of cases) and/or a tail (in about 10. 


t a prehead (in about 2 

T cent of cases). ination may be represented 
б Cafe t of the system of subordination y D 

їп three stages: 


1. The width of a TU2 unit (or units) S TUI ps WEST 
Tange in relation to the ‘middle’ ran ie iti BY ee. р 
Luma. p of the position Oe. \ 
ОСЕ бш у tira 
rdinate fall, > ; 
Ог falls within TUI, or falls outside or 
isolate eight categories of rise- А D 


m Re MUT 


Di 7 

4 [NIN SE NIN. ә [o7] ^ 13 

a ЫКТЫ е NC DST 10 Z 19/1 14 РАДА 
Ма 5 6 Se) tte Яз .. 74 E 
СО е о ыс”! 044 _ 10 J 

ү ei 

or. NS eZ e 
SEU se ü m e/12 

З АЕ 7 AISAN Dale 15 141 f 

EA NTUN 8 NENT 7012 7 Us) 16 7 tA 


-d 
- "narrowness may theoretically occur in all parts of the system, and results 


а пољ ну. „МЕДИНЕ ТИ ПИВА. 
the interlinear (е = ‘extra-high’, h = ‘high’, 1 = ‘low’ beginning poin 
of the subordinate tone) 


3. But this is not yet the complete system, because each of the possibi a 
outlined in (2) has an alternative form with narrow pitch width. Thi 


in a system of thirty-two types of subordination to distinguish all ү 
variables within the rise and fall categories. In practice, however, not à 
of these numerous possibilities are realized with equal frequency. Some 
indeed, are highly unlikely ever to occur, for example: 

IenN] N BICI 


~ 


ВИ ӨЙ 


The most frequently occurring patterns in my data were those which 
approximate most nearly to a nuclear tail, namely: 


SSIES ATH Ne ли и 


NAN Na vo У ЈЕ 
but the following three patterns were also common: 
INIA 1/14 7 Uzi 
a ^ е/ е/ si v 


But whereas a tail has little prominence and its only pitch movement 5 
to follow directly that of the nucleus, the subordinate tone unit has а 19 А 
pitch contour which always results in increased overall prominence = 

compared with a tail, and hence in a clearly different significance aS Za 
At the most general level, for example, we may interpret the utteran o 
1 [rórp you [f didn't Inwàwr to| as carrying more information (Di " 
1 |TOLD you 1 'didn't 'want to|, though the nature of this extra informati? 


132 Theory | 


Pe. o 


remai { SE d 
mains to be defined. To illustrate this point farther, we might consider 


a | ч : : ` 
кше: how |Ever [this may |pi]], in the sense whatever the case may 
- 5, which is different in meaning from: how [Ever] this may |p|, where 
der than) the first and where 


the 2 2 ~ B 
ү cond tone is equivalent in width to (or wi 
SM may be a longer pause following ‘however’: this has the effect of 

ng the utterance an independent sentence, with the paraphrase 


(well), this may be the case’. 

INA is one notational modification in the examples: the simple pitch- 

ud Symbols e, / and ћ relate the starting point of the subordinate 

acum to that of the superordinate nucleus, not to the pitch of the 
ing syllable or segment-initiator- Thus in: it |wouldn’t be [{hANy] 
D the А relates ‘any’ to ‘use’, and the | relates “апу? to *wouldn't". 

fr he basic patterns may be set out as follows (all examples being taken 
Ош the data): 

^: Simple subordination, i.e. опе TU2 either preceding or following TUI: 

[in [reis "сова У 


th 
ere aren't мАпу ‘murderers lexecuted 
ves 


D LJ 
thi ~. • И ps = 
his is ect. [the Jet Ae) Губки) twine which [ARE] TÉRMed] 
ai 
Ke 
EE 
B t d 
Cie subordination, i.e. 10979 than опе TU2 either preceding or 
lowing the TUI. 
of which there are two 


ће, ; 

GC is particularly typical of S 

ordina when the first subordinated H, ich is mi 
„nated to it in immediate seguen ut for example: 


a 


it were a fresh T U2 related (0 a 
ou've sugictstedlll 


this ; 
33 is not o!nricatory mee? 
Zar 


arked accordingly 


М Оу 
inated to TU3 in like 


It is 
А " 4) 5! 
Possible to have a fourth unit ау to exist. (b) У о 
a third) accom- 


Sübora: 
Srdinated unit (T U2) has а sell o is form 
whic! A FG to TU2 in relation 


Dying ; 
in: 9B it in immediate sequence jons SiM 
ў functions $ ordinate subordination’ MU 


Ditch 
jot Lange and start and W to 
oe Such a sequence may wier, 
*ferred to as T U22, for examp e: 
у David Crystal 133 


{ 


ууф о: 


d TU2s, ` 
Den 102) has another unit sub- | 


Where the first. 


anner t 
but longer sequences С0 no nit (am 
unit PI equivalent to TU2 


A 


In prison]| 


[n3 


A TU22 may of course have a ТОЗ functioning in subordination to it, 
for example: 


e.g.: I believe [in de|tention Inckntres] – *[or |GLAss 'houses 


Е омо wu e ~ SS x 


“fas they were |cAtted in the 'army]? 2 «оу narrow creak’ 


"Een. У e ө Ө е 


“allegro” 


Only the first two of these patterns (TU1 + TU2 + TU3, TU1 + TU2 
+ TU22) are at all frequent. 


2. Complex subordination is rare in positions before TUI, there being 
but three instances of this in the data; one example is as follows: 


lonly a . ә: а гіу мбмтнзј |[Arrer . о: [ t wktcoming]]. with 


9 ee 25 en ONE e s o 
(реши пат Se пиши па E Coo REV ИН. ЛЬНА 
en" 1trRüsiasm [of[ricially]] 

cec rum eee) 

nw к E 


—————————— 


e Se subordination, i.e. one TU2 preceding and another follow” 
ing the › аз already illustrated in the i here Wet 
few instances of this in the data, DUE ar sola 


D. Compound complex subordination, i.e. complex subordination both 


preceding and following TU1 (which is highly unlikely), or complex sub. 
ordination either before or after ТИ with simple subordination in ш 
alternative position, There is one suc! ing 


OU N dom case in the data, of the follow! 

Finally it is necessary to cater for 
only a fall-rise or rise-fall) which 
"These nuclei (or the corresponding 
in which case they must be repeai 


the occasional complex tone (usually 
can enter into subordinate struct 
compound types) may occur aS 

ted in TU2 in narrowed form, 97 


134 Theory 


followed by the relevant primary category (^ with У, and ` with ^), to allow 


subordination to take place at all. The latter is by far the most frequent. In 


preposed subordination, a narrowed complex tone of the same type as 
TUI may occur, or a simple tone which is of the same initial category as 
the phonetically most dominant element of the complex (or compound). 
tone, for example we may have the exocentric ^ + ^, or `+ У, and not the 
endocentric ~ + ^, or ^ + У. The criterion of pitch width is still the deciding 
factor, though modified: the kinetic tone as a whole (and not merely its 
Starting-point) is now considered in relation to TU 1. Thus en is sub- 
ordinate to e^ and would in turn subordinate e . If this does not 
apply, it is still possible to infer subordination using the secondary criteria 
of reduced prominence (cf. above) and a lower or higher pitch-range out- 
Side the range of TUI. This would necessarily apply to level tones where, 
as we have seen, pitch-width is not applicable. For example: 


S let ne он аеро У 
Ok И о с ~ о A 


Ato reo rendum of L tux] een 


GE oe. = ea Q^ C^ 
the ар Аред [ma|chinery in the coURTS]| 
the ap EA [ma]chinery in the COURTS 


920: em Те кеца "ers 


Sao) 9— J нат и киш се 
Levels may of course occur as TU2s, where the па TT 
kinetic nuclei; there the narrowness of n in TU2 bas merely 


to extremes, for example: 


“2? Ithe [Home 4stcretaryil 


Буле ө- ә je 
по о | а о —— 


On the whole, there is по di 
tones is subordination or some other 
Sequence of unrelated units or а сотр 


fficulty in deciding whether a sequence of 
prosodic phenomenon, such as a 


ound tone. 


Refere | 

псез : Dd 

CRYSTAL, D., and QUIRK, Ё (1964), Systems of Prosodic and Paralinguistic 
‚О, , К. 

Features in English, Mouton. and analogous features as exponents 


Avy, D. * A study of intonation 3 ersation 
uin (1968), © with special reference to а comparison of conv 
istic variation, ty of London M.A. Thesis. 


With written English read aloud’, Universi 


David Crystal 135 


Ў 


"УМ 


е Gr ri noh of English Intonation, Longman, 
d SvARTVIK, J., RUSIECKI, J. P. L., and 

“(1964), "Studi in the correspondence of prosodic to grammatical 
inglish,” in Proceedings of the Ninth International Congress of Linguists, 
Camb dee, Massachusetts, 1962, Mouton. 
‚ and CRYSTAL, D. (1966), *On scales of contrast in English connected 
In Memory of J. R. Firth, edited by C. E. Bazell et al., Longman. 


А 


7 Dwight Bolinger 


Relative Height 


Sieht Bolinger, ‘Relative height’, from Prosodic Feature Analysis, edited by 
erre R. Léon, Georges Faure and André Rigault, Marcel Didier, 1970, pp. 109-25. 


Summary 

pot only is pitch a layer that interacts in 
ayers of language; its also layered intern 
this theory) can be distinguished: 


1, A rather highly grammaticized layer, 
minence per se), terminals (rise, fall), an 
and other discourse divisions). 

2. A partially grammaticized layer, 
syllables in relation to reference points (which 
This is the layer of ‘controlled’ affective meanings: 
his attitudes and along with them the information that t| 
message, 

3. An ostensibly ungrammaticized | 
Syllables. Here are the 'uncontro 
Conveys his attitudes and along wit 
prude his message. But this of course is a messa, 
aked. Hence the ‘ostensibly’. 

4A genuinely ungrammaticized layer, 
gi or narrow range, extra-high pitch, 
ess likely to be. 


The paper concentrat 


complex ways with the other . 
ally. Four layers (according to. 


including accents (syllabic pro- 
d levels (parentheses, paragraph 


covering the behavior of accented 
may include other accents). 
the speaker conveys 
hey are part of his 


ayer, that of the behavior of unaccented 
lled’ affective meanings: the speaker 
h them the information that they 
ge too- anything can be 


that of levels dictated by emotion: 
etc. These can be faked but are 


es on the second layer and the third. 


e linear way. This is obvious from 
intonational scale, is 


ch to be interpreted 


Intonation cannot be analysed in а simpl | 
Certain types of embeddings in which the whole i 


Teduced for a given stretch of speech, forcing that stret 
ance. The clearest case is the parenthesis. 


11 contrast to the whole utter : i 
Within it we find normal syllable-by-syllable contrasts — a particular rise 
1 pitch may stand out and be interpreted as signaling importance within 

arenthesis is signaled 


€ parenthesis — but the importance of the entire р ў 
low in the utterance as а Whole. Example: 5 


Dwight Bolinger 137 


Zeg 


Some 4; 
times Wong 


ils 1 роп 5; 


~ nf 
when I have me to der on such thi? 


wi 
any of these efforts are really worth 1 
whether e. 


(Pitch of whether slightly higher than things). ТЕ the when clause is put first, 
it has the same internal shape but comes up to the average level. [ 

Parenthesis is not the only example of wheels within wheels. Andia 
fairly obvious one might be called paragraph intonation. In a series O' 
sentences each of which ends in a low pitch, one usually detects an ona 
all lowering at the end, signifying the closing of a particular topic o 
discourse. If we can say that a downward movement of pitch signifies 
finality, then this represents finality imposed on finality. As with other 
kinds of embedding, there is probably no logical limit: one could have a 
complete-narrative intonation superimposed on a paragraph intonation 
superimposed on a sentence intonation. But there are practical limits. 
Intonational range is used up rather fast. 

The third example of layering is the theme of this рарег A pitch 


1, The contrasts in pitch dealt with in this paper are not the ones that produce а pitch 
accent on a given syllable — abrupt, but not necessarily wide, departures from а refer- 
ence line. Rather they are relationships of accented and unaccented syllables on а large 
scale. For example, if we are in a wallpaper store and pronounce this sentence 

single А 
need 
a 


I 


Tow. 


the dealer will understand us to mean just one roll; if we say 
sin 
need 
a 


gle roll.’ 


sin 
a 
ed 
I ne 
gle roll х gle roll 
in which the accentual contrasts are the sa 


, се“ 
: me but there is an affective differen 
Accentual differences are all-or- 


ERG" : adient 
none. Affective, intonational differences аге ега“ 


138 Theory 


movement like the one in figure 2 can be described as a rise-fall. Initthere ` ` 
is a syllable (or possibly two or more) that is higher than its surroundings, 
and the height relative to that immediate environment we can suppose to a 
have some kind of effect. In figure 3 there are two such rise-falls standing ` 
close together, and now we must ask how they influence each other. I 
propose that the most obvious effect is related to the tangent that is drawn 
from one peak to the next, and that there are differences between rising - | 


and falling tangents: | 


3 E 
2 ў 
For example, taking sentence number 4, | 


4. I'd get his consent if I were you. - | 
апа putting it on 3 with a rising tangent, the consequences are SE 
Same as putting it on a falling tangent. One thing we'll пу р «шш. 
the relationship between this tangent and à continuous рі c mo sis 
Connecting the peaks, in this Case giving 5 and 6 with rising an e 
Pitch respectively: E, 
get is 
sent hi consent 


his con s : 
5 pa sptwere you. 6. Td if I were you. 


My first examples are with commands, ps 
least two accents — regularly fall from the 


which — when they contain at — | 
t to ће 1851,2 and contrast — 


rm oft commands and that of question- 
full display- Thus Beat it! and Get ` 


2. Ambiguities may arise between the normal fo: 
syllable, the second two. The first, | 


answers if there are not enough syllables to give 
80ing are synonymous but the first has one 219% 
In the shape 
Beat 


it, 
T t of the blue, an 
Serves equally as a command delivered m Ош contrast, for t 


d as an answer to What would 
hese two situations, aS — 


You do if he threatened you? The У 
follows: 
| 


бе go 
Get 
SCH T mp 


Dwight Bolinger 139 


with question-answers,? where there is a rise. So the command 


7. Get, |. 
hair dii 
E y А he 
differs from the question-answer — which might come in response to t 
question What do I have to do to look respectable? — 


hair 
8. Geta 


cut. 


When we try to reverse these we find that the question-answer coud 
given the intonation of the command, though — precisely because it Уо ч 
sound like a command instead of the expected answer to a question » E 
would appear rude; but the command could not be given the intonatio 
of the question-answer — I would not come into the room, look at you, ап 


3. Intonationally, the class ‘statement’ is both too broad and too narrow. It is ton 
narrow because intonation is in the main indifferent to the grammatical form of a 
utterance. In the example, Get a haircut. You can get a haircut. Get a haircut would Я 
my suggestion, etc. are all the same as answers to the question. It is too broad becaus 
intonation is concerned not With statements as statements but with different kinds 0 


S e i th 
them. There are statements thát come as Observations having no connection ул 
discourse, e.g. the remark that one might make to a motorist who carelessly splas 
water with his car, 


That | H 
n'ta nice thing to 


which is quite different from the answ 


j ion 
er one would give to an interlocutor’s quest! 
Why did you object? — 
/ do i 

Wasn't a nice thing | 
Tt n'ta io 
7 а nonsstatement intonationally identical to this would be the petulant child's rep!» 

cay 
Be 56, 


Even the notion of ‘questi&n-answer’ needs to be 
in а special sense, because th 
example, answering When cai from you? it would sound strange foe Ў 
on hour — it should во on see. Que 
9 cover answers in which the speaker has in mind lef 
ut the information I am giving him? To allay that WO” y 
al for his acceptance.’ The Majority of questions do! 
alls for an overall rise in the answer. 


(will) my hearer wonder abo 
I will make a stronger appe 
the degree of wonder that c; 


140 Theory 


Say out of the blue Get a haircut with the intonation of number 8. We 
Observe the same incongruity in 


9. Let 
me ~ 
BIV you, . hair 
Y9" a piece of айу; *Geta 
се, cut. 
So within the overall intonation of commands, what happens? I will 


use examples with enough syllables to give a good display. I might say 


10, Hand 


me that little P^ 
e the e 
a knife of yours, 


Or I might say 
1, Hand _ 


те 


that у, 
little реп knife of yours. 


Number 10 has two peaks, the first higher than the second; number 11 
is more or less continuous downmotion. Both аге good commands, but 
they differ in their appropriateness to situations. With number 10 you 
Would not be surprised if I took the penknife and went about my business. 
With the second, you would probably expect something more from me - 
In the way of words or performance, e.g. 


12. Hand me 
that little реп knife of yours. 


Iss trim 

502 what haPpens when I” this a bit, 

This contrast can be ignored for the moment, to consider again the main 
Point: that the relative height of the two peaks has the same appropriate- 
ness to commanding as the continuous downmotion. It can be contrasted 

With the overall rise on a question-answer using much the same wording, 
Ог example answering the question What did he do to help?: 


pen i 


13, не UN me that little 


H 


Knife of his. * 


7 With two peaks, the second higher than the first; or, using a continuous 
Tise, { 


реп 
, us that little 
14, не handed 
knife of his. 


Sa ` Dwight Bolinger 141 


"There is а situational difference here too, but the main thing is the overall 
am same comparisons, with the same results, can be made ч Mv 
tions that use a terminal fall. But I don't want to overdo the ber? n e 
Thus far the only terminals used have been falling ones, ап bor 
accents have been relatively high ones. There is also the contr: "^ idi 
with rising terminal and accents that stand out by being lower Deu 
reference pitch. We ask once again whether the tangent to two 5 ~ 
accents is comparable to continuous motion in the same directio 2а n 
Questions make the best examples, as they come most naturally We 
terminal rise. We try two, in which the tangents go in opposite dire 


15. Is thing 16. Is thing 


? 
nê 
wro 
= 
= 
some ` — 


= 
me ett 


4. Questions of this type usually imply that the speaker already has part of ts A 
formation he wants, Imagine that someone is planning to give а party but 15 no 
good terms with the prospective guests. A friend asks 
р GC 
< PS 
Would  ybody me? or Would  ybody me? 


ul, 
Listeners would probably interpret the first to mean that the speaker is just doubt ed 
and the second to mean that he is pretty sure no one will. If the two questions are ipli 
with a continuous rise or a continuous fall, there is a difference, but the same ^ 
cation of mere doubt as against negative assurance still carries through: 
co 


DAE 
о 
Бобу Geh 
Would anY me? or Would me? 
Similarly for interrogative-word questions with two peaks, 
D 
= ~ 
=> Kéi PT ~ 
Se vince how ~ 
= Sa 
P Qo —— — 
2273 D vince 

ST how 


But could you con him? gp But could you con hi 


and with continuous rise or fall: 


142 Theory 


against two others, with continuous motion in place of the tangents: 


2 
wf 


17. 15 something 
Something 18. Is 

Ka ог 

го 
The effect of the overall rise – however you want to describe it, perhaps a 
Stronger appeal — is the same, and similarly the overall fall. 

For a question-answer using the same configurations, we сап imagine 

SZ asking a question and а parent answering it as if to put the child 

ack in his place. The child asks What's this? The parent answers 


pr ET. 20. It's of 
— something 1 
- d ne 

~ ге. ог тій 2 

Уыт? hing АНИ 

EXC some > 

~~ pu 
Lr 
hese pretty clearly warns the 


SCH upmoving tangent їп the second of t 
child to mind his own business, despite the p: 


1 . n 
Dverted accents. Comparing these with continu 


seudo-sweetness of the 
ous movement We get 


21, It’s si 
Some ing of 7 ing% 


~ again the warning stands out in the overall rise. The same comparisons 
5 


Cay B 
п be made with commands. 
|y commands, intonationally speaking 


eese arc reall 
kage down on the table, and initiate a 


5. But there is a question whether th! 
puta pac! 


ion would not come into а 100?» 
ourse with 


put, 
‘ting this a 
own 


DUE 
Fm , he, Please одус it 
e 


jp 
1 : 
ine he would end his speech with 
lease leave ita ў 
1 
о, 
To, 
Dwight Bolinger 143 


ZU: 


If the relationships hold for terminal rises as well as terminal falls, what 
about a terminal rise-fall-rise? If зотеопе asks you What shall I get 
Maude for her birthday? you might reply 


~ we 
тес -~ = 
Cra do == Ke сап 
~ == 
сап ста 


23. She’s Zy about d or 24. Shes zy about d 


These compare with 


can 
а bout 


crazy about: 
7y about can сга2У 


25. She's or 26. She's 


d d 


(It is instructive to see the intonation of the first part of the speech. It does not answer 
a question, but gives information that the hearer is expected to act upon, and has the 
intonation of a command. To say , 


һе 


put 


Dm ting this down 
re. 


would be inappropriate, though it would be normal — with it replacing this — in answet 
to What are you doing with that? The authoritarian tone of the two successive falls сап 
of course be sweetened with gesture to any degree the speaker chooses. On the othe 


hand, if the utterance came in answer to a questi : is? it 
у estion, e.g. h this» 
might well take the shape q n, e.g. What shall I do wit 


just pa 


leave’ ле 
1° 


and similarly if it doubles for a conditional clause, as the imperative so often docs: 


P WIR 
Y a 
m this down he, Just A everything will be 
e leave" е and 
101 fi 


fie, 


‘If you will leave it alone, everything will be fine.’ 


144 Theory 


m, 
\ 

Once more the overall downmotions match, and the overall upmotions.* 

The examples up to this point have been comparatively uncomplicated. 
They have involved two peaks at most, and the accents have moved in a 
uniform direction. Utterances are not limited to such rudimentary com- 
binations, and may show not only more than two accents but also more 
than a single direction in which their pitches tend. Consider first a Series 
of three peaks with a uniform rising tangent: 

а 


KI 
„= 
кс be 


ae 


EAE much 


1 
27. Yt was 2 most too to 
ar. 


Next with a falling tangent: 


much pe 
28. Yt was most too UL 


These can be compared to the simple 
be 


to 
too much 


rise and simple fall: 


29, It was almost 
ar. 


upmotion was given first. The last examples have the 


6. In the previ mples the Беја 
lownmotion Set ТӨ до it the other way would have been to invite Ze SE 
tween the two examples, making 23 appear to refer to something. CS SE TP e 
BS if the question had been What shall I get Mande for her birthday? К 6 D 
Tight? This illustrates a difficulty with accents: when they highlight the ui Бе, аза 
Whole, ог just the lexical items that carry them. Either interpretation can be given to 
Cither of the two-peaked utterances, though it would be less usual to have the higher of 


the t see i :cal item candy that is getting the attention and 
nds peskwaend afe ade iem problem with which this discussion is con- 


cand у а ione! N P H RB 

cerned a Papi К ЕГ i d overall upmotion maintain their 
affected — * Фа, " 

Telationships. А Er example of utterance accent as against individual item accent 

Es = /here commonly the items lack individual significance. 

is being called to account — the word 


са BE 
е sought among idioms, W KE 

5 idi that 5 d 
Look elen aniidiom Mer it may be taken literally too. Either way thesame 


que does not mean ‘in this place". But 
Cents and intonations can be used, and 
he 


Loo d 
k and Look 
Zei be 
со Tel 

Atrast with each other in either sense 


Dwight Bolinger 145 


With i inverse accents there is a problem when more than two are We 
y à i tangent. Tt is possible though pretty unusual to join them on à 


Гай тог test is w wi h questions, but the results are the same: 
Is there t thing. with 


. Ner theless, Ше: gents and the steady movements can be compat ^ 
" befe ore. Example 33 can 


There are of course more complex possibilities. In principle, it is likely 
that either normal or inverse accents can be combined in any order, 


though some sequences are unlikely. The commonest type of combination 


is the one in which inverse accents at the beginning of an utterance serve 
аз a foil to high-pitched accents at the end. The tangent to the inverse 


accents can point in the same direction as that of the terminal rise, or in 
the opposite direction: 

be 
36. It was too to 


аг. 


Whereas in number 37 there are two dimensions of contrast with the final 


rise, in that the accents themselves go down and the tangent to them also 
es, we also find a single dimension of 


Boes down, while the final accent ris à k 
Contrast with only the tangent in contrast with the final rise: 


be 


Other possibilities include an inverse accent between two normal ones, 
but this is unlikely unless the tangent rises: 


ae 2 
—— mon told bim 


—— 
— — wg 


to mon 


40. 1 the 
dr get 
ey. ey. 


Dwight Bolinger 147 


(Notice that 40 is not the same as 41, where get is unaccented: 


told 
4. I him to get the 


mon 


еу.) 


They also include one ог more inverse accents before two normal ones; 
_ the tangent to the inverse accents may fall or rise, but again the tangent 
to the normal ones generally rises. I combine the diagrams:? 
* 
чес 


~ T cmd 
rie ~ 


ta fil 


~ 


д a led, 


|. Tt seems unlikely, too, that three successive peaks will have the highest 
one in the middle 


ld di 
t 
43. ?I R himto the GE, 


еу. 
— unless the middle peak is on а word that is made prominent for its oW? 
sake, such as an intensifier: 
ter 
al 5 
44. Yt was mosttoo  ribleto E about, 


p ‚ иң те“ 
à 7. It is necessary to confect an example with a good display of syllables Шш 
duced vowels to show the difference between an unaccented syllable kept at а low 


to set off a following accent, and an accented syllable brought to a low pitel 
inyerse accent. Thus looking at 


| св 
It him out 
lft with a 
nt, t 
$ + tol 
the appearance is of a succession of falls and rises alternating syllable by syllable eft? 


148 Theory 


More examples could be added — in i i 
— іп particular I could cite pitch directi. 
that are broken between unaccented syllables: Я MY 


d 
cou! 
mon 


him А 
simply as ү to be refl with 


45.5 told Sa 
gument and would not, I think, 


But these would only complicate the ar 
ics of interrelated accents.* 


affect what has been said about the dynam 
D 


A The continued rise (or level) after an accent that is marked by being jumped up to, 

ео told, sim-, could, and care- in this example, is the criterion of what I have else- 

Bro e termed Accent B, in contrast with Accent A, which is jumped down from, as 
on mon-. An example of the same sentence with a succession of А accents: 


told be doen 


d аге 
1 ї Ш tobe ui. ful with 


him as ply as 

ey. 
bles after a B accent wit! 
he extremes а striking, 


ape behavior of the unaccented sylla h reference to a following 
accent makes a graded, but at t difference in the attitude 


conveyed: 
mi 
you you 
е 
7 Ke nef you 
Ust Just mi Just 


lieve be don’t 
dont be don't lieve bi 
I ont 
1 ї 

o him. 
Kä may attempt ап exegesis of th 
4 Cents relative to each other, sinc 
| height of unaccented syllables is 
he overall rise carried by the mounti 


е 
lieve 
him. 


1 do it just in terms of the height of the 
tions of height above the reference line 


him. 
ese three I wil 


e the implica: 
dealt with in the text. ; 
ing succession of accents in the first of each set 


of three intonations represents hearer-orientation. Indulging in what may be fancy but 
Strikes me as close to truth, I would say that overall rise is a primitive alarm cry. It gives 
that verbal alarms are always ex- 


information that is useful to the tribe- The fact is 
Pressed this way, and also that – 25 the examples have repeatedly shown — question- 


Answers, which are oriented toward the person who asked the question, do the same. 
© do most questions, which are appeals to the hearer — they differ in their terminals 


Sometimes, but not in their tendency to rise. 
ts represents speaker-orientation. 


The Overall fall of the third members of the two set с 
ings are settled. The speaker is assured, or speaking for his own benefit, or domi- 


Dating someone. This is the regular intonation for commands. In the examples, Just 


Dwight Bolinger 149 


ЖЫ 


The best inference I can draw from the evidence T've given is that there 
is an intonational layer consisting of accented syllables and their relation 
to reference levels and to one another, which is independent of the changes 
in pitch that occur elsewhere. There is nothing remarkable in this, because 
it has been the custom to pretend that unaccented syllables did not matter, 
that their behavior was determined by that of the accented ones, except of 
course at certain pause points. But I think we must now go on and ask 
whether that is true, whether changes in unaccented syllables — while 
accented ones remain the same – affect the meanings of utterances. 

Returning to examples 10 and 11, I recall suggesting that after 10 the 
subject might, well be dropped, while after 11 — and this was expanded in 
12 = something more would be expected. In 11, not to try to describe it in 
more definite terms, I think we can say that there is a sensation of being 
keyed up. This is even more noticeable in 25 and 26 as against 23 and 24. 
То take а fresh example, imagine someone being asked two questions: 

E p ЖОЕ, living as a cripple, which would you prefer? and How 
ing your whole life as a cripple ? Imagine being given tw 


t 


never you mind means ‘My time will come, you'll catch it no matter what,’ and J don't 
deese him means *Nothing will shake my personal doubt." А 
ice members baye neither an overall rise nor an overall fall. The first accent i$ 
Ria continue m of the rise, the second is approached by a fall. The attitude 
pL EE ц eoft e first two, with the second predominating because it comes 
US TE RUE g something like This is suggested as something to be aroused about, 
possessed about it.’ It is typical of ho-hum exclamations (Glory be, well 


what do you know. If it isn't San i 
› 2 m Sneed) i 4 
importance the speaker dismisses, мы тушш ae Wi 


to to 
ing 
"There's i) w hap” 
or Гл do 
ry about, it. 
things of which the speaker disclaims responsibility, suggesting ‘I don't саге": 
Y to 
е same · 
to the >> 
go dev all i 


N it 
He can It’s m Take leave 


il. e it. 


With this intonation an utterance like Don't you worry would not be said to placate th? 
hearer about his own worries. It would be said either to mean ‘You don't worry me 


(Don't you worry; I'll get you yet) ог ‘You (or I) don’ a 
d ту: on't nee thie 
from X’ (Don't you worry, I'll take care of that big bully). Wir ERU 


The contrast between overall rise and overall fall i iger* 
I 1 all is the іа Shubi£? 
article (1967). It contains many examples from British Se dE 


150 Тћеогу 


answers to match the two questions, and then decide which would go best 
with which: 
di 


e; 
If I can trust my own reactions, 46 fits the second question and might 
possibly be used with the first, but 47 is more than a bit unnatural with the 
Second question. The first question poses a logical choice; the second calls 


for feeling. What signals the di 
unaccented syllable. I would ! 
Showing the speaker's involvement with the idea. 
better of him (or he wants to pretend that they do), 
maintain a high pitch. 

What now of the effect of raising t 
gradient and we can hike up it as much as we p. 
have 


ike to characterize its function as that of 
When his feelings get the 
the unaccented syllables 


ћерисћопап accented syllable? This is 
lease, In place of 47 we can 


di, 
48. Td E 


е. 
I submit that the difference now is not the speaker’s involvement with the 
idea, but rather the expressivity of his message. Where the sustained high | 
Pitch on unaccented syllables was as it were an uncontrolled factor, letting 
through the message the speaker’s feeling about the idea, extra-high pitch 
hed unaccented 


On the accented syllable against а background of low-pitc 
the speaker to underscore some fact 


Ones is a controlled factor. It enables у : 
Of his message as new ог surprising. Tt is as if lowering the pitch on the 
Unaccented syllables — when everything prompts one to raise it — were 
Saying ‘Look, I’m in control of the situation; any raises I use are for my 
Purposes, ` This is crude, and I will restate it just as crudely: high pitch with 
a background of low pitch serves the speaker; overall high pitch betrays him. 
T use the term ‘controlled’ rather than ‘logical’ because the meanings 
Of intonation are too all-embracing to be confined to logicality. Neverthe- 
less, cases of logical emphasis are the best examples of what I mean by 
Control’, If in answer to Why are you 50 partial to older people? one heard 


are u 
49, The Yours 


‚ worthy. 
Dwight Bolinger 151 


fference in the answers is the behavior of the ' 


Сг 9E 


` rather than 
trust 


50. The SORS are un 
worthy. 


the logical contrast on young would be lost, and the speaker would seem to 
have been carried away by his feelings. 

As we range over more intonation patterns, the usefulness of the notion 
of ‘control? becomes more apparent. For the speaker has it within his 
power to turn everything upside down. This is clearest with the inverse 
accents, in which accented and unaccented syllables simply exchange 
places — high-pitched unaccented syllables convey involvement, low-pitched 
accented ones blunt the impact of whatever facts are reported or demands 

- made and the rising terminals remove any impression of assertiveness. 50 
l these are used to reassure, to wheedle, to object without offence: 

Не ... г. РЕ 

51. Не Agent T 52. Wouldn't it be yt 

mean a 
nW 
this 
bett! 


We need, of course, a law of truth in intonation as we do in lending. A 
HM in which the normally uncontrolled is used in a controlled way 
is a kind of falsehood. So reversed accents are apt to be taken as conveying 


formalized, hence insincere, emotion. There are other similar cases.? 
То summarize: 


IS T nu of an accented syllable by comparison with that of another 
accented syllable produces the same effect regardless of the height of 


9. For instance, putting exaggerated high pitch, extra length and loudness and 


оаа (ог glottalization of vowels) including sometimes a rise-fall-rise, 00 p 
normally unaccented syllable. The basis for this is seen in the type 


co, a do 


d 
How 1 you 
it! 
in which accented could gets the treatment. It then is carried over to unaccented 


syllables. The effect is one of great involvement И incet? 
i D check: since 
exclamations are especially common: ed by great restraint. In 


` i, the dy you d of my 
What How Out 
ld! о! 


ay, sirt 


152 Theory 


шн unaccented syllables. An overall rise contrasts with an overall 
all. 


2. The height of an accented syllable contrasting with a following un- 
accented one conveys the impact of the message — its logical import, its 


informativeness. 
3. The height of unaccented syllables conveys the involvement of the 
Speaker. A return to low-pitched unaccented syllables implies control. 


These effects combine. An utterance like 


You shouldn't 


y. 
wort 


has high involvement and no message im 
Show the speaker's concern and minimize the importance o 


One like 


pact — the whole purpose is to 
f worrying. 


wo: 
shouldn’t 
ou 


r y. 
has no involvement and high message impact. Tt present 
Way the idea of not worrying. One like 


s in a contrastive 


Shouldn't wor 


You 
r 
Yo 
Shows both involvement and high messa 
ате still pretty hypothetical, but they are Д 
Semantic interpretation of intonational elements 
Cular utterances that they fall on (including utteran: 
and statement), and independent of the particular com 
SH combinations with one another and combinations 
Satures, in which they occur. 


10. See footnote 8 for MY interpretation of this. 


ge impact. These interpretations 
a stab at getting some kind of 
ndependent of the parti- 
ce types like question 
binations, including 
with other prosodic 


Reference 
nal functions of the low-falling 


Зонџртовв, М. (1967), "А note on two notio 


Nuclear tone in English’, Е": Stud., vol. 48, no. iG 


Dwight Bolinger 153. 


"T 
“ЧУУ, MR 


Рагї Тһгее 
Intonation and Grammar 


Probably the most important grammatical function of intonation in the 
language family to which English belongs is that of tying the major 
parts together within sentences and tying sentences together within 
discourse — showing, in the process, what things belong more closely 
together than others, where the divisions come; what is subordinate 
to what, and whether one is telling, asking, or commanding. Pierre 
Delattre surveys these uses in French. His study is not only valuable for 
relating intonation to grammar but also for revealing the underlying 
kinship between the intonations of Western languages. What he has 
to say about *major and minor continuations’, for example, can be 
applied to English if it is seen in relation to other possibilities. English 
has two ways of separating the clauses of a sentence. One of them is 
much more the rule in English than in other European languages; it 
Consists in dropping the pitch at the break but then letting it rise slightly: 
„топ 
both fund the e tite a 
If he returns ne re 
of nt. 
7 the fall here is from both to of and the rise is on them. This intonation 
is used when the speaker intends the first clause to be viewed as a new 
idea. But if it only repeats what has gone before, then English tends to 
Use the same curve that Delattre describes for French. Imagine the 
tion What do I do if he 


above example said in answer to the ques! 
returns both of them? Yt would then probably be pronounced 
mm mou 
сП Р 
f th fund Ae entite а 


o 

If he returns both те 
nt. 

7 &simple rise all the way (0 the comma. This same intonation is d 

Common on other expressions that аге *not new’, such as folk sayings: 


E. 


| 
Easy come, easy go; One for те money, two for the show; Ask me no 
questions and ГИ tell you no lies. Delattre was always as much interested ) 
. jn teaching as in experimenting and theorizing, and his treatment of 
French intonation has a clarity that is ideal for a textbook, drawing 
effectively and eclectically from both camps in the intonation 
controversy, the level camp and the contour camp. 

А number of problems in syntax are easier to solve if it is assumed 
that at a level above the sentence itself there is a super-sentence 
meaning something like ‘I assert this’, ‘I ask this’, ‘I order this’, 
according to the mood of the speaker. Sometimes it is explicit: we can 
actually say / assert that he did it rather than just He did it. More 
often only a fragment of the higher sentence is there, usually in the form 
of a ‘sentence adverb’ such as truly, generally, hopefully. But usually , 

іп English no actual words betray its.existence, and where the words | 
“ате missing, intonation fills in. English uses numerous patterns for 
assertions, questions, commands and exclamations. It is not to be 
expected that all languages will make equal use of verbal signals on the 
one hand, and intonation on the other, for this purpose. In the second 
Reading Maria Schubiger shows how a favourite device in German, the 
modal particles, is paralleled by intonation in English. 

The question of whether to be broad or deep plagues every scientist. 
The solution is to be both, but that is impossible in practice because no 
one has time. The formal grammar of the 1960s concentrated on the, 
Sentence as a self-contained structure and taught us much dbout it that 
we could not have learned otherwise. But it sacrificed the broader 
vision of the sentence in a community of sentences, or discourse. Of 
Jate this has begun to be remedied; such things as presuppositions, 
coreference, and presentatives are seen as linking sentence to sentence 
Ап important use of intonation is to mark а relationship of this kind 
PEE Ce а sentence and the one that precedes it. Richard 

unter's article, published here for the first ti [ 
commonest of the intonation contours are ПАЛА i 
incidentally has something to say about the reality of intonation 
phonemes, The notation used is that of Trager (pp. 83-6). 

The part of grammar with which intonation cooperates most 
consistently is that of word order. When a sentence adverb is moved 
from the beginning to the end of a sentence, the intonation usually 
tells us so; bearing 


end 
play 
е 


У. 
Ls һар?! ^ 


156 Intonation and Grammar t р 


rather than 
hap 


lay end 
The play ended 


meaning is the same as that of Happily, the 
seful is with sentences in 

d – Mary loves John is not 

Iter the meaning – and 


we are pretty sure that the 
play ended. But where intonation is most u 
which the order of words cannot be change! 
convertible to John loves Mary, as that would a 
still the speaker wants to make a particular word prominent. The 
grammar does not permit him to do it by moving the word to a 
Prominent position, for example to the end of the sentence; so he 
Tesorts to a change in pitch. This device is so common in English that 
it adds substantially to the impression of English as a language 
exceptionally rich in its use of intonation. An extreme example is 


want 
I don’t \ М 
to tell him he isn’t the kind of person Т would care to spend 
the rest of my life with. 
With everything after want at a low-level pitch – to convey the meaning 
that all of that is more or less understood from what has already been 
said, and the real focus of information is on what the speaker wants. 
In the last Reading, František Daneš surveys the possibilities and 
limitations of word order, and then shows how intonation makes up 
for deficiencies. His work is representative of the Prague School of 
linguistics, whose members have made 'sentence perspective -how 
meanings and their communicative weight or importance are distributed 
along the length of a sentence - à principal concept in their theories of 


language. - 


і 


Intonation and Grammar 157 


8 Pierre Delattre 


The Distinctive Function of Intonation 


from Pierre Delattre, The General Phonetic Characteristics of Languages, University. 
of California at Santa Barbara, prepublication of Research Contract 
OEC-4-7-061990-0176, with the United States Office of Education, 1966-7, 


pp. 81-102. 


ut it, a statement can often | 


Intonation is the salt of an utterance. Witho 
lorless. Incorrect uses of It 


be understood, but the message is tasteless, СО 
can lead to embarrassing ambiguities. 


Arrêtez le voleur! (Arrest the thief!) 


could be understood as 
Arrétez-le, voleur! (Arrest him, you thief!) 


if the intonation curve does not keep falling until the very end. 


Vous l'appelez imbécile? (Ате you calling him an idiot?) 
might offend the listener and be heard as 
Vous Vappelez? Imbécile! (Are you calling him, you idiot?) 


unless the rise of the curve increases to the end. у d . 
Distinctions of meaning that are due to differences 1n ie anes 
Curves are not always so clear as the ones above, however. 


i tion is truly linguistic; 
why t е appeared. For some, intonatio! \ 
EE hape in voice inflexions carry meaning. 


difference d inal s| 
s of level and termi Hexion 
For others, intonation contours are not truly distinctive, they merely 
b 
reflect the attitude of the speaker. 
For the latter, between the encou 
(He is intelligent, that one.) 


raging statement 


1l est intelligent, celui-là. 
Said with a continuously falling pitch on intelligent, and the ironic one 


là. (Heis intelligent, that one.) 


and fall on gent, an infinite number of 


Said wi istic rise on lli 
with а characteristt which all reflect differences of attitude 


nuances of meaning are possible, 4 Herencesno 
but cannot be categorized into discrete units. To obtain linguistic changes 


Of meaning would require Some change at the segmental level: 


Il est intelligent, celui- 


Pierre Delattre 159 


Soe EE es 


Gol ed 


П fut intelligent, celui-là. (He was intelligent, that one.) 
Est-il intelligent, celui-là? (Is he intelligent, that one?) 


For the other school, every intonation contour has a distinctive function 
and can be classed in one of the families of contours often called pitch 
phonemes, or pitch prosodemes, or intonemes. Then, 


Vous sortez. (You are going out.) 
with a falling contour, is meaningfully distinct from 
Vous sortez? (You are going out?) 


with a rising contour, to the same extent as 

Vous sortez. (You are going out.) 

with the pronoun preceding the verb, is meaningfully distinct from 
Sortez-vous? (Are you going out?) 


with the pronoun following the verb, 


Without taking sides with either school, we shall find ourselves closet 
to the distinctive-families point of view because we shall concern ourselves 
here only with the ten most frequent and most clearly defined intonations 
of French. 

But first, to orientate ourselves and realize that a variety of pitch con- 
tours can play a role in communication, let us transform a few segment? 
sequences by exclusively changing the Suprasegemental curves. 

Everyone can hear what a gruesome meaning is given to the sentence 


What shall we have for dinner, mother? 


when mother is said on a sharply rising intonation, instead of on а 10% 


plateau. Similar blunders can occur in French. Figure 1 offers a few 6 
amples of such transformation by pitch. Patterns 2, 3, 4 and 6 might 
result from a ‘cannibal attitude"! 2 asks, ‘Shall we eat mother?", 3 answe™ 
"Yes, we shall eat mother,’ 4 explains that it is obvious, ‘We shall M 


mother (of course). Who else is there to eat?? In 5, the rising pitch, takin’ 
the place of the falling pitch of 1, changes the question to ‘Are you asking 
me, mother, what we shall have for dinner?’ And 6, not addressed їй 
mother any longer, means, ‘Are you asking me what we shall have a 
dinner? I am answering you that we shall have mother.’ А 
Figure 2 offers another segmental sequence which lends itself to Haf 
formation by pitch. Examples 2, 4, 6 and 8 could 


: D 
on b again result from а (0 
of cannibalism. м и Р 


160 Intonation and Grammar 


1 
Qu' est - се qu оп a pour le diner, maman? 

2 
Qu’ est - co qu on a pour lo diner? Maman? 

3 
Qu’ est - co qu’ on a pour lo diner? Maman. 

4 
Maman. 


Dé est - co qu’ on а pour le diner? 


Qu’ est - co qu on a pour le diner, maman? 


Qu'est - ce qu on a pour le diner? Maman. 


Figure 


can be inflicted upon à segmental 
The question of line 1 is answered 
gful. And lines 9 and 10 


Figure 3 illustrates the tortures that 
Sequence by varying the pitch contours. | 
in seven different ways, all differently meanin 
Present two more ways in which the same sequence can be understood. 
Let us now see how order and objectivity can be found in this apparent 
labyrinth of intonation curves. It can be done by applying to the study of 
Pitch contours the rigorous method of phonemic opposition in minimal 
Pairs. This method yields the segmental phonemes of a language by sub- 
Stitution of one segment in а sequence. Thus, the oppositions of meaning: 
рее, été, aidé, aîné, айё, effet, essai, aisé, show that, in French, the phones 
P, t, d, n, 1, f, s, z/ ate distinctive consonant phonemes capable of pro- 
"cing a change of meaning; the oppositions: pire, pure, pour, pere, peur, 

Mn Part, demonstrate that the phones /i, У, U, & ©, 9, а/ are distinctive 
Owel phi { 

Sltnlarty. vubsgrutiond of pitch curves should yield the distinctive 
Mtonemes of French. Illustrations of such substitutions are presented in 

igures 4 and 5 by means of ten pairs of utterances. 


Pierre Delattre 161 


Јеап-Мапе va manger, mon enfant. 
2 

Jean-Mario va manger mon enfant. 
3 

Jean-Marie, ма manger, топ enfant. 

4 

Jean-Marie, va manger mon enfant, 
5 

Jean-Marie va manger, mon enfant? 
E 

Jean-Mario va manger mon enfant? 
7 

Jean-Mario va manger, mon enfant, 
8 

Jean-Marie, va manger топ enfant. 
Figure 2 


162 Intonation and Grammar 


" | 


Qui prenez - vous dans votre auto? 
2 
— ÓÓ— 
La sœur de Jacques Laval et vous. 
Е ~ 
La scour do Jacques, Laval, et vous. 
4 
La scour de Jacques Laval et vous. 
b 
La sour de Jacques Laval + et vous? 
6 d 
La scour de Jacques Laval ; et vous? 
7 
La sour de Jacques Laval ; et vous? 


La ош de Jacques, Laval; 


la valez - vous? 


La zen de Jacques, 


us? 
La sour do Jacques Та Vallée ? Voi 


Pierre Delattre 163 


1 
Anne-Marie va travailler, 
2 
Anne-Marie va travailler? 
3 
Anne-Marie va travailler, 
4 <r 
атаан 
анан ==. 
= —————= 
Anne-Marie, va travailler. 
5 
у Elle demande qui va rentrer, 
E 
Elle demande: "Qui va rentrer?" 


SSS 
Ele а dit 


quel scandale, 
А . 


Elle a dit; “Quel scandale | 


Ц a vendu Son chateau en Espagne, 


П a vendu son château 


еп Espagne, 
Figure4 


164 Intonation and Grammar 


Si les prix montentencore, on sera forcó d'emprunter. 
12 
=== SS —— ] 
— —1i 
Si les prix montent encore, on sera forcé d'emprunter. 
13 
Quiva venir, Anne-Marie? 
14 
Qui va venir, Anne-Marie? 


Ello prétend qu’ ele refusera, ја méchante, 3 


la méchante? 


Elle prétend qu' ello rofusera, 


méchante, 


Ello prétend qu’ ello refusera, 


Elle prétend qu’ ello refusera, 


c'est bien faire? 


Sept enfants, 


Cet ` enfant sait bien faire ? 


Figures 


Pierre Delattre 165 


Sh alf 
This could mean a fall to level 2 or to level 1; the next minimal P 


H H о 
In examples 1 and 2 of Figure 4, we oppose the expression of we ` 4 
that of question, by means of a question and an answer. odis add 
2 form a minimal pair because the segmental content (t tour B 
consonant phonemes) is the same and the first. intonation con a2 s 
same in both examples; the difference of meaning between 1 © fae 
makes one understand 2 as a question and 1 as an answer (or as RE. of 
is only a replacement of the second intonation contour. АП mes Gë 
meaning that will follow in successive pairs of utterances will o? KN 
‘minimal pairs’ if the difference of meaning depends upon а герја 
of intonation contour in only one of the sense-group slots. "ER. 
The first contour of 1 and 2 (Figure 4), Anne-Marie, is mildly epe NI 
don't need to know more than that for the moment; later, this ridi 
rising contour will be opposed to another one and its distinctive M XE. 
will be made clear; here, we use it merely as a point of reference an E 
it rises as little as possible but does not seem to start from the 10 
possible pitch level, we assume that it rises from level 2 to level 2. ille 
If we try to utter the second element of examples 1 or 2, va traval id 
with various degrees of RISE in the intonation, we note that in ое 
understand the sentence as a question, уа travailler must rise higher t io 
Anne-Marie; it must therefore rise to level 4; the level at which this dd 
of va travailler should start is not relevant — we, shall call it level 2 ie o 
{тагу on the basis of phonetic analysis of the curves — the only rele" 
level is the one at the end of the rising curve, PON cer 
If we try to utter va travailler with various degrees of FALL in the шб 
tion, we note that in order for the Sentence to be heard as a states Jj 
travailler must fall lower than the last syllable of Anne-Marie (lev ` 


j 


(examples 3 and 4) will tell us which, 


In examples 3 and 4, we oppose FINALITY to COMMAND Бу the Ss 
stitution of intonation contours in a single slot, that of va travailler. 
order that va travailler be heard as a command, the contour must faln 
the fall must start higher than the level-3 ending of Anne-Marie (whi¢ 
continue using as a reference), that is at level 4. s, we 

If we lower the level of the start of this falling contour in small steP (i is 
note that when it approximately coincides with level 3, the meani y, 
ambiguous – уа travailler is heard neither as a command nor as a fin? i 
To understand va travailler as finality, without any possible confusion. 2 
command, the contour must start lower than level 3, therefore at Je 
And since it is a falling contour, it must fal] from 2 to 1. m 

The ending level of the command fall is less relevant than its st th? 


level. We shall call it 1 because phonetically, it tends to coincide Wi" 
ending level of the finality contour (2-1 ). 


166 Intonation and Grammar 


m 


p then mean: Is she asking who will return 
he is asking: * Who will return?" (direct discourse). 


purum 5 and 6, we oppose FINALITY to INTERROGATION, also 
alle information question’ or ‘falling question’, by means of cin t 
discourse and indirect-discourse utterances. The change of meanin| = 
pes by contour substitution in a single slot, that of qui va es 
e contour of the other slot and the whole segmental content remainin - 
fixed. To take the meaning of interrogation rather than finality, the йш 
adir of qui va rentrer must start higher than the 3-ending of elle demande, 
erefore at level 4. The end of the fall is not relevant; it is arbitrarily bes 
at level 1 because it generally coincides with the end of finality, according 


to acoustic data. 


It is interesting to note that the qui va rentrer of this sequence, in direct as 


а well as indirect discourse, could be said with a rising (2-4) contour. In 
at case it would not express interrogation. It would be the end of a 
Question bearing not on qui va rentrer but on elle demande, even though. 


elle demande could only have a 2-3 rise. The two terms of the opposition 
? (indirect discourse), and: 


hich fall from level 4. There is 
Jamation (or at least its most 
ontour freedom than 


We have just described two contours W. 
One more, in French — the contour of exci 
typical realization, for exclamation enjoys more © 
Other expressions). 

Examples 7 and 8 oppose finality to exclamati 
Style utterance, meaning: She divulged what a scandal had been caused, and 
à direct-style utterance, meaning literally: She exclaimed: * What ascandal!’ 
To be heard as an exclamation, quel scandale must start at à higher level 
than the 3-ending of elle a dit, therefore at level 4. 
5 Acoustic analysis of the three falling contours whose distinctive func- 
tions have just been defined show regular, but small, differences among 
them, Those differences are schematized, On Figures 4 and 5, by a de- 
Creasing fall for interrogation, а straight fall for command, and an in-- 
Cteasing fall for exclamation. However, those differences are too small to be 
Perceived easily. Auditory tests of those pitch curves after the word content 
had been filtered out gave negative results with naive listeners; only trained 
Phoneticians were able to distinguish among the three curves. For this 
Teason we should perhaps call those three falling curves non-distinctive 
among themselves but only distinctive in regard to all the others. We сап. 
assume that, if sharper intonation differences among those three contours 

ave not developed, it is because the grammatical differences are generally 
Clear enough to require no prosodic help. 
,. Whereas finality contours аге falling, continuation contours are rising, 
In French. This is as it ought to be — one should easily hear, at the end of 
еасһ Sense-group, whether the sentence is concluding or continuing. But 


on by means of an indirect- | 


Pierre Delattre 167 


there are two different contours to express continuation in French. One 
rises higher than the other. The greater rise is called major continuation; 
the smaller one minor continuation. Examples 9 and 10 oppose minor 
continuation to major continuation. Here, however, а direct opposition of 
the types used in the preceding minimal pairs is not possible and we must 
have recourse to the next best procedure, which is a crossed opposition. 
Minor continuation rises to level 3 and major continuation rises higher, 
therefore, to level 4. This difference of meaning between examples 9 and 10 
demonstrates that a minor rise to level 3 and a major rise to level 4 have а. 
distinctive value. When il a vendu rises to level 3 and son cháteau rises 10 
level 4, it is clear that the owner is returning from Spain, where he sold his 
castle (a castle that could well be in France). But when the contours are 
exchanged, that is, when il a vendu rises to level 4 and son château to level 3, _ 
it means that the castle he sold is in Spain (and the sale could have taken 
placein France). Obviously, the division into immediate constituents occurs 
after a major continuation rather than a minor one. ј 
Examples of changes of meaning that are produced by exchanging the 


place of major continuation contours with that of minor continuation 
contours are common. 


Il a peint (4) la jeune fille (3) en noir (1). 

аны that ће made а portrait of a girl wearing black clothes. 

Il a peint (3) la jeune fille (4) en noir (1). 

could mean that he covered a girl with black paint. 

Il a demandé (4) qui écrivait (3) à sa fille (2)? 

means that he wanted to know who it was that wrote to his daughter. 
Il a demandé (3) qui écrivait (4) à sa fille (1). 


means that he asked his daughter to tell him who was writing. 


Similar oppositions occur in English, but they are based on the place of 
stress more than on intonation contours. 


They decorated [| the girl | with the flowers. 
is not the same as 
They decorated | the girl [| with the flowers. 


But the main distinctive function of the major continuation rise 15 not 
made entirely clear by examples of crossed oppositions such as the on 
given above. It appears even better in ‘echelon’ series of sense groups» 2 
below. 


168 Intonation and Grammar 
> 


j 


з 
Si Anne-Marie (3) vient nous voir (4), оп sera là (1). 
Si Anne-Marie (3) vient nous voir (3) demain matin (4), on sera 18 (1). 
Si Anne-Marie (3) vient nous voir (3) demain matin (3) pour le déjeuner (4), 
on sera là (1). 
Si Anne-Marie (3) vient nour voir (3 
sur la terrasse (4), on sera là (1). 


) demain matin (3) pour le déjeuner (3) 


The function of the major continuation contour seems to be to unite 
Several small units of meaning into one larger unit of meaning which does 
not end the sentence. Here, the rise to level 4 indicates that all the small 
sense-groups, from Si Anne-Marie to sur la terrasse, belong to the large 
unit of meaning which ends on the word terrasse. IL ` 
Examples 11 and 12 (Figure 5) do not present an opposition. They are 
given to illustrate a peculiarity of the minor continuation contour, namely 
the fact that when it precedes a higher pattern (and only then) it can fall, 
às in example 12, as well as rise, as in example 11. It can fall when EE 
by а higher pattern, as in si les prix, but it must rise when followe m 
à lower pattern, as in ou sera forcé. It should be noted, pps а 
minor continuation seldom takes that falling shape, and takes it for no 
Other purpose, perhaps, than to break the monotony of repeated rising 


Contours, И ; 

Examples 13 and 14 (Figure 5) oppose the question to the pore do 
After falling contours, and after the contour of ure) Mees a 
Parenthesis is expressed by a low plateau, close to level 1, e Ge E 
example 14, to convey that the words qui va venir are ро m: Nu 
Marie and not about her, Anne-Marie d E said at the 
Which the i ation qui va venir? had ended. 1 

ein Ek 16 SEH (rather than oppose) two different levels of 


: i u is low, 
Parenthesis. After a falling contour, as 10 DES ih de We note, 
but after a rising contour, the pl 


ateau is high, as in 
then, that the parenthesis always takes the shape of a plateau, but the level 
Of the plateau varies according to thec 


ontour that precedes. Since the level 
Of the parenthesis plateau is thus conditioned, it can be said that the 
Various levels of parenthesis are ! 


n complementary distribution. 
The parenthesis plateau has many uses. It is most frequent as thesecond 
element in the structure of reduplication. 


C'est [шї (finality), le voleur. 4 e. 
Qui le veut (interrogation), се livr е-Їй! ‹ 
Је le connais (implication), VC ami. 
l est Ia (question), Jean-Marie? 

inissez-le (command), се morceau. 


Pierre Delattre 169 


MH 


Tt is the intonation of the vocative: 


Entrez-donc (command), Monsieur. 
Vous désirez (question), Madame? 
Que voulez-vous (interrogation), Anne-Marie? 
J'ai compris (finality), Jean-Pierre. 


It indicates a quotation: 


‘Attendez-moi,’ (command) dit-il. 
“Il est fou!’ (implication) s’exclama-t-il, 
“Il est là?’ (question) demanda-t-il. 


Examples 17 and 18 oppose finality to implication. The implication 

contour normally shows a quickly decreasing rise which ends with ап 

. embryo of a fall. The implied idea is generally not explicit, as in example 

18; it is most of the time merely implicit, as in the next series of. examples. 

` Thecontourof implication is used very frequently in everyday communi- 
cation. To a question such as, 


“Il est arrivé, (2) mon ami? 


one might answer with a series of implication contours, in order to ге 
assure the questioner, 

Mais bien sir... 

Il est la... 

Il vous attend... 

Et avec impatience . . 


If the implications were ex; 
‘be used: 


Mais bien sûr . . . voyons (low parenthesis), 
Il est la... votre ami. 

Л vous attend . . . le brave type. 

Et avec impatience . . „ croyez-moi. 


pressed, the contour of low parenthesis would 


It is probably the first element of a grammatical reduplication that makes 
the best use of the implication contour: 


Je l'ai aperçu . . . votre ami. 
Elle est connue . . . cette histoire, 
J'en ai assez . . de cette affaire. 


Implication can replace finality, 


J'ai vu Jean . . . (vous savez). ' 
ЇЇ na’ pas compris... (le pauvre type). 


170 Intonation and Grammar 


Tt can replace command. 


Donnez-le-moi . . . (s'il vous géne). 
Qu'il parte donc... (puisqu'il n'est pas heureux). 


It can replace a i i i 
question contour, the meaning being then radical; 
to a request for approval. PM 


Vous viendrez . . . (n'est-ce pas)? 
C'est bien vous... (qui l'avez fait)? 


It can even replace an exclamation, to lend a flavor of mystery. 


Quelle horreur . . . 
Quel scandale . . . ^ 
Au secours , . . 


Finally, examples 19 and 20 show that it is not impossible to oppose ` є Н, 

minor continuation directly to another contour. But it takes a four de force 

of homophony. If sept enfants is uttered on the low plateau of parenthesis, 5 
opt enfants, c'est bien 


the phonemes (setiifaisebjéfer) are understood as: Se 
our of minor continuation, the | 


faire? But when sept enfants is given a cont 
pere sequence of segmental phonemes is heard as: Cet enfant sait bien . 
aire? | 
We mentioned earlier that the three contours that fall from level 4 are 
looking back at Figures 4 


Dot clearly distinctive among themselves. Now 
апа 5 we note that we also have three contours that RISEtolevel4: question, 
major continuation and implication. But those three are fairly distinctive 
among themselves, according to auditory tests given to naive listeners. In 
the figures, they are given schematized shapes which roughly represent their 
Objective variations: for the question, ап increasingly rising curve; for the 
major continuation, a decreasingly rising curves and for the implication, a 
decreasingly rising curve wit! 


ha slightly falling appendix. У 
To indicate the distinction by symbols, we add terminals to thelevelsas — 
follows: | 


Question: 2-4, 
ajor continuation: 24 
™Mplication: 2-4. Y 


contours that emerge from our 


intonation from « 
only seven are clearly distinctive: 


In summary, of the ten : 
imal pairs, 


analysis by oppositions in min 


1. Question G-A 
3; Implication (2-4) 
+ Major Continuation (2-4) 
Pierre Delattre 171 


d SE SR ^N 


1 
} cuestion 
C' est là qu’ elle tombe? 
i major 
continuation 
C est là qu’ ele tombe et tien n'y fait. 
3 
} implication 
C est là ou elle tombe, la pauvre. ^ 
-== i 
C————— ы ш ш Rue e mmo] 
Пн вс EE gege 
======= continuation 
Qu' elle tombe ou non, рец importe. 


high | 
c 


est là, ou elle ` tombe? 
6 


low 
. C est là, qu'elle tombe, 


7. 


C est là qu'ello tombe, 


, Quelle — tombe? 


9 


`» 
Qu'elle tombe. 


( 
10 j 


Quelle tombel 


Figure6 


172 Intonation and Grammar 


} minor continuation 


Quand j'ai vu 


} majorcontinuation 


l'accident, 


У ` 
Kees 

EE IL ^ 
ea А5101 

ee d 


j'ai pris peur. | 
Bi 


Se re ji 
Sa е } exclamation 
Se, SS 


Quelle horreur! ч 
} command y | 
Aidez - nous. ' j 
6 
} question 
Vous nevoyezpas? 
7 
} high paronthesis 
Monsieur l'agent, 
B 
} implication | 
Jo vois fort bien, 
A } low parenthesis 
Chore Madame. 
10 
} interrogation 
Quo puis-je faire? 
Figure 7 


Pierre Delattre 173 Е 


We сап also illustrate those ten contours through a brief dialogue | 
gure 7): “Quand j'ai vu l'accident, j'ai pris peur. – Quelle horreur! 

-nous. Vous ne voyez pas, Monsieur l'agent ? – Je vois fort bien, chère ` 
dame. Que puis-je faire?’ 4 


9 Мапа Schubiger 


English Intonation and German Modal Particles: 
А Comparative Study 


Maria Schubiger, *English intonation and German modal particles: 
а comparative study’, Phonetica, vol. 12, 1965, pp. 65-84; published by 


S. Karger, Basel. 


Author's summary 

After having established a certain parallelism between German modal 
particles and English emotive intonation, the author examines in detail the 
English tone patterns that can correspond to unstressed German dochin the 
sense of ‘By the way you talk one would think you didn't know." It is 
found that both the prenuclear tone and the nucleus can express this con- 
Notation, the former mainly by avoiding the neutral stepping head, the 
latter by a preference for a rise-fall, in some cases a rise, instead of the 
Toreneutral fall. Statements, commands and questions are passed in review. 


The investigation of English intonation has reached a point where its 
form has been explored almost to perfection, but where the various at- 
tempts to assess its function have resulted in а mosaic of partly concordant, 
Partly divergent opinions. Neither the description of French intonation nor 
that of German — to mention only two languages the present author is | 
familiar with — has given rise to Similar discussions. This is not only due 
to the fact that the intonation of those languages has been investigated less 
thoroughly than that of English — both American and British ( Received 
tonunciation’) — but also to the less prominent part played by intonation 


аз sole bearer of subjective functions in these Janguages. In French there is 
а Wealth of turns of syntax, ‘la syntaxe affective’, unknown to English 
(See Schubiger, 1935, p. 53); in German the so-called modal particles 
largely perform this task. Neither in French nor in German does this 

Hr ; — more ог less summarily — by both 
0:0 аап РАДОН cate th other languages have been 


lexico; i isons Wi 
graphers and grammarians. Then compari d а | 
Таут. Arndt NES them with the similar Russian modal particles, Collinson 


(1954) wi У ish means of expression. Although there are fewer 
) with the corresponding Pagh equivalents can be found in most cases. But in 


Modal particles j > 
Gett g tendency to do without them. This situation is also 
German works of literature. In the books © 


тећес у T A f 
GE ipee (nd) in her recent dissertation on doch and its 
t of the doch particles have been 


Maria Schubiger 175 


impair the role played by emotional intonation; but it is ST ү es 
grammarian by-pass this delicate subject, since hie can базе his 
i ression. 

E ам SE EE more than one occasion pointed to M. 
semantic correspondence between German particles and aw n 
tone patterns (Schubiger, 1935, pp. 32, 36-8; 1958, pp. 44, ee E 
purpose of this paper to make a further contribution to the d jj nied + 
great number of German particles, both simple and combined, me sid 
possible for the speaker to put into words practically every shade of fe e 
he wants to express. The elocutional means on which the English RG. 
heavily relies when urged to express his feelings, though just as expre: 


7 5 e " 
Kë differentiated, are much more elusive. At the base there is one of th 


two fundamental tone patterns: A nuclear fall (F) ‘connotes sual 
“nuclear rise (R) or fall-rise (FR) need of supplementation. With v. 
sentence type one basic tone pattern can be considered neutral (R 3 
general questions, F for all other sentence types), i.e. not connoting e 4 
thing beyond what is expressed by the words, the other marked, i.e. et à 
veying some additional meaning. The latter can only be gathered from = 
contents and context of the sentence, Roughly speaking, it can also 24 
gathered from the sentence type. It has often been stated, e.g. that R їп 
stead of neutral F makes imperatives into requests and special questions 


E 3 5 ither 
into requests for information. The prenuclear patterns, too, can be eith 
neutral or marked,? 


It is interesting to note that whi 
the German particles, which, 
faced with similar difficulties, 
many cases be gathered only fr 
and the various meanings ofte 
types. Three examples will ill 


еп we set about assessing the meaning of 
it would seem, was an easier task, we are 
The precise meaning of the particle can m 
om the contents and context of the sentence? 


E се 
n fall into groups that correspond to senten™ 
ustrate this, 


more closely, 
is voice qualit; 


ess or softness, etc.), partly a modification 

the tone pattern itself: large: i average pitch (called У SEH 
| made the closest tenin 

*. Pike considers these pheno™ 5) 

р. 99-101). Catford (1964, Р. Ae 

ion in speech, as opposed (0 


study of this subject, whic! 
under the title ‘Modifica 
Speaks of the paraphono 
phonological and the non. 


176 Intonation and Grammar, 


1. пир Imperative 
Hab nur keine Angst. Warte nur. Lap mich zur machen. 
Tone of reassurance. With other voice-quality: threat (second sentence). 


Question 
Wie kommst du nur nach Hause? Doubt. 


Statement 
Ich muß mich zur wundern, daß er es aushält. Emphasis. 


2.doch ` Retort 
A. Wieso kann Jacques denn so gut deutsch? 
В. Er ist doch Elsässer, doch = as you should know. o 7 


Exclamatory statement 
Der Hans ist doch ein Schlaumeier. 


3, schon Reassuring statement. Reaction to interlocutor's doubtful or 


Worried utterance. | 
Ich werde schon aufpassen. Er wird schon darüber hinwegkommen. Die 


Firma wird es schon zahlen. 
Calculating statement, pointing to a minimum requirement. 
Tage wirst du Schon rechnen 


Ich werde schon aufpassen müssen. Drei 
müssen, Zehn Franken wird's schon kosten. Wir werden's schonrunterneh- . 


Шеп müssen, 

There is a similarity between German and English also in this: just as 
the basic function of the English intonation pattern 15 felt more or less 
distinctly, or has faded altogether, the originally notional meaning of the 
German particles has been eclipsed to а greater or smaller extent; e.g. 


English 
It `isn’t “bad (= but it is not very good eit 
Here the implication, expressed by the FR, 
ind you don't " fall. | 
ere there is no suggestion of something that might follow. The FR adds 
а Warning note to the imperative. 


her). 
is felt quite clearly. 


German 


S 65 auch wahr? (= You say it. 
H аз der Kerl auch für Einfälle hat! 
ere the notion of addition has disap 


Purely emotive, 


But it is also true?) 


use, the rising trend being more usual when 


In consequence the insertion of a German particle or the use of a certain 
English nuclear tone is in some cases essential. English Jt ‘isn’t Ураа is 
notionally different from It 'isn't ‘bad, German Ist es auch wahr? from 
Ist es wahr? In other cases – chiefly exclamatory sentences – one can make 
free use of these means of expression, their notional function having 
completely faded. There is no difference of meaning between 
German: Bist du elegant! and Bist du aber (or mal) elegant! 

English: "You gare ,elegant and “You „аге ,elegant. 


In view of this parallelism between German particles and English tone 
patterns it is tempting to compare the two means of expression in more 
tail. Yet within the scope of this article such a comparison can only . 
Ver a small and strictly limited section of the vast field. We shall make 


. meaning our starting-point and consider rejoinders with the connotation: 


*by the way you talk (or act) one would think you didn't know (or were 
ignorant of the circumstances)’. This kind of utterance lends itself to à 
comparison, because the connotation is in English nearly always €*- 
pressed by intonation alone, in German by an unstressed particle, in most 
cases doch.? In English the tune with the nuclear F preceded by one of 
more low-pitched or gradually rising and therefore rather reduced innate 
Stresses very often has the above connotation.* This is not surprising 
« Reduced innate stresses often denote that these items are not new to the 
Situation; they also tend to give the utterance an unpleasant ring; €.8- The 
iclock has stopped again. I'm afraid Гуе up,set the ‘milk (O'Connor ап 
Amold, 1961, pp. 109-10, 120). Now by definition the particle doch We 
are here concerned with occurs in a sentence whose contents are not €07 
tirely new to the interlocutor. Moreover, unpleasant surprise, protest, се“ 
sure easily colour an utterance which recalls to the interlocutor what he 


should know, or remember, or be aware of. Here are some statements wit 
this tone pattern: 


A. When can I have my typewriter back? 


3. In his interesting book on the technique of translation Fritz Giittinger considers 


CURL рр and bitte — as one of the tests of a good German translatio? 

: Indifferent translators d i i i 

no counterpart in English (1963, p. 148). тр MUT anal ds. 
4. Allen gives both prenuclear patterns, saying that it is quite immaterial which уң 

td E introductory unstressed group iS S 

SC ives both patterns, the level prenuclear intona! 

being pitched very low (1958, p. 52). In O'Connor and Arnold (1961) the tone gradu? 


rises from the first semi-stressed s; 0). we 


yllable to th. fecti h 
shall here make use of both ways of Zeie REESEN, H 


long (1954, p. 68). Kingdon, too, 


178 Intonation and Grammar 


B. I sent it to you ,three ‘days a,go (O'Connor and Arnold, 1961, p. 122).5 
Ich hab sie Ihnen doch schon vor drei Tagen zurückgeschickt. 

(To somebody who seems to have missed our previous remark.) 

That's , just what 1 ‘said (Palmer, 1922, p. 73). 

Das hab ich doch eben gesagt. 

(To somebody who does not see an obvious equivalence.) 

That's the same ‘thing (Palmer, 1922, р. 73). 

Das ist doch dasselbe. 

(To somebody impatiently waiting for breakfast.) 

Гуе only just got ‘up. 

Ich bin doch eben erst aufgestanden. 

(To somebody suggesting an invitation to complete strangers.) 

We don't even ‘know their nationality (Jassem, 1952, р. 71).° 

Wir kennen ja (or doch) nicht einmal ihre Nationalitàt." 

/Willie is a ‘friend of mine; we went to 

р. 25), 

Der Willi ist doch mein Freund; wir sin 

gegangen (Frisch, 1959, p. 217). : 

1 day na ЖУ ЕТАР „making а noise when you yeat (Frisch; 1962, 

р. 19), 

Ich hab doch kein Wort gesagt, 

But your fifth (husband) was а ‘surgeon 


Aber dein fünfter war doch Chirurg (Dürrenmatt, 1956, 


Carpenter (in despair): 
But they (the boards) are made to "measure (Brecht, 1957, р. 24). 


Aber sie sind doch nach Maß gemacht @ ee » SE two kinds 
5. The specimen sentences with а reference to E SEET I (1961); stees 
Of books: а) textbooks of Intonation, above all O'Con 


; and English works of 
be Utterances are set in WEE а SE c Lor cance wie 
ite А , ап 

ature, mainly modern plays, Tue German 


doch i Ce, translation without Ges а Ee 
i — or aj B 1 ress nnotation, 
maces 20 translated — ог 1 pu therefore intonation must exp s 
in the English origina! — without saying 


d doch zusammen zur Schule 


daB Sie schmatzen (Frisch, 1959, p. 213). 
(Dürrenmatt, 1962; p. 51). 
p. 50). 


that this marking is im- 
iter. It goes 
SH resent writer: ‘ble, just as other patterns are often 
Reine as aed variants would be pee EE Eeer 
Conceivable with the tonetically with a great number of subjects 
D marking would have to be bas red necessary for this compara- 
trading the same passage, а laboriou 
We study of two languages. fth 
б. We had to construct this and a few o Р 
ЧУН NO a EE 
У i ifferen' ег һај 
Së Ee Te it ja krank), D Ca? the interlocutor of some- 
ny (Er ist doch SEH doch, the latter merely remin - 
„ сгзануе force of stres * 920. 
ing he knows, see Ерке 1926, Р: 


e following contexts, аз most authors give 


` 


Мапа Schubiger 179 


мећоо! to,gether (Frisch, 1962, .— 


| 


© 


„= 


Housekeeper (to Shen Te): 


i ` | 17). 
i-dea who you ‘are (Brecht, 1957, р З 
АН weiß doch gar nicht, wer Sie sind (Brecht, 1956, p. 27). 


i ified 
If the surprise or protest is vivid, the prenuclear section p a n 
i i isi t is sometimes repeated, i 
in various ways. The rising movemen 4 Er. 
the prenuclear stresses have their full weight. The pre-head can be pi 
high or low; e.g. 


C) Гуе no i ded who you ‘are. 


| heir nationality. 
7) We ,don't even ,know t i 
У Eu ,can't be ‘bought (Dürrenmatt, 1962, p. 36). 3: 
Die Gerechtigkeit kann man doch nicht kaufen (Dürrenmatt, 1956, p. 


С) But hé ,wants to ‚татту her to the ‘barber (Brecht, 1957, p. 60). aa. 
Aber der will sie doch mit dem Barbier verheiraten (Brecht, 1956, p. 


“Tf the retort is chiefly a protest against the interlocutor’s inconsistency, 

-or similar contradictory attitude, the R nucleus is appropriate. Te 
ance verbalizes one element of the contradiction, the R nucleus implies t 

other. Here French pourtant renders the same meaning; e.g. 


(To somebody who was pleased yesterday and is now complaining.) 22, 
It was all right ,yesterday. (So why complain to-day?) (Palmer, 1924 
p. 78). 

Gestern war’s dir doch recht. 

Tu étais pourtant d'accord hier.1° 


(To somebody reproaching us for inactivity.) 
Гуе done all I ‚сап. (So why do you blame me.) 


n ni- 

8. The connotation phoneticians have attributed to this pattern is remarkably i is 

form; O'Connor and Arnold (1961): querulous or disgruntled protest (p. 40). tion 
noteworthy that several of O'Connor and Arnold's drill sentences with this intona! 


Н 5 | ro 
(Tone-group 3) translate into a German sentence with doch; while none of the nume A 
statements with full 


prenuclear stresses (Tone-group 4) would in German suggest SE , 
Kingdon (1958): patient expostulation (p. 125). Jassem (1952): surprise, protest ES is 
Palmer (1922): retorts (p. 73). Pike (1945) says of the rising pre-contour, whi 
relatively rare in American: protest (p. 68). 

9. Also in German there can be a 
for greater liveliness, e.g. Aber ich 
Though the description of Germ: 


will be an occasional reference t is 


E ~ ou! 
10. Pourtant is not essential, The rising intonation with equal intervals through! cen 


the main bearer of this connotation. The same pattern with a greater interval be "У 
the penultimate and the ultimate s; 


yllable is interrogative (see Coustenoble an 
strong, 1934, р. 56)... 


„op make 
rising trend in the prenuclear part, MS. La 
weiß doch gar nicht sie'sind (Collinson, 195 le, therë 
an intonation is outside the scope of this article, 

o it. 


180 Intonation and Grammar 


P hab doch mein Móglichstes getan. 
ai pourtant fait tout ce que j'ai pu. 


AN somebody who has burst into tears.) 
а crying. You're по longer а child. 
uß mit den Tränen. Du bist doch kein Kind mehr. 


Ti ob 
gs poog complaining that he cannot enjoy the party.) 
‚weren't unhappy on the last occasion (Palmer, 1922, p. 78). 


D; 
as letzte Mal hat's dir doch gefallen. 


A ^ А 
B Ce t let us joke any longer about... 
Wi ете not „joking (Frisch, 1962, p. 59). 
ү. г scherzen ja nicht (Frisch, 1959, p. 244). 
о the mate who went off to get wood-wool.) 


W Е 
Dot on earth is keeping you so long? 
ood-wool is easy enough to get ,hold of (Frisch, 1962, p. 39). 


H В 
olzwolle ist doch keine Sache (Frisch, 1959, p. 228). 


beard: How can the wind hurt him 


nr 
a Hire: The wind is against him. Blue 
1936, p. 119). Orleans liegt 


at 
do ad It is not on the Channel (Shaw, 
ch nicht am Kanal (Shaw, 1925, p. 107).1* 


коо Sometimes this tone pattern, especially with negative ог 
Strictive statements, does not so obviously point to an inconsistency. It 
ions attributed to it by p! 
eproving criticism, resen 
-6) translate in 
ow ri. handicapped by not distinguishing the 
Rab going up only a little and t ften reaching above the middle of the 
whi al voice range. That is why they have her the above connotations, 
ich are expressed by а considerable rise, and ‘reserving judgement, guarded’, which 
‘stinguishes the two types of rises by calling 

hatic low rise’, with the connotation ‘im- 
p. 222). Some o ry well be pictured 
ggests doch. Jassem, 100, 


1 i e 
En this pattern, too, the connotat 
г: O'Connor and Arnold (1961): т 


So Ee are here concerne 
id B ог exasperation’ (1958, 
DE next that in German SU N, 
а low ed’ nuclear tone (1952, P- 76) from the full-rising one, 
prenuclear tone, indicates “surprise, bewilderment or protest’ (pp. 75-6). 

ise, always preceded by a rising head, has 


Рај 
mer also has the two rises, t 
Ў low rise without head. He says of the one we are here 
i ing ‘then why поё... '. Many of 


i ED dece 
5 examples are retorts of t 78). Halliday likewise distinguishes 


ut two rises, statements with the full rise being lal 
së sive, etc." (1963, p. 22) The present writer has resorted to а makeshift, speaking 
(Schubiger, 1958, p. 44). Pike (1945) calls the low 


Га "relatively high-rising 10% rise" 

Tise у high-rising 10 е 

Se deliberative, the high rise (4- 
d seem to the present writer, that are mutually exclu: 


question ;. us 
stion is outside the scope of this article. 


Maria Schubiger 181 


Lem 
| D 
"AN 
| 4 Џ 


chiefly connotes protest, criticism. Here, too, German doch is appropriate; 
e.g. 


ave to sack him. 
x Dads 't do ,that (heis too useful) (O'Connor and Arnold, 1961, р. 49). 
Das darfst du doch nicht tun. 
With R statements, where, contrary to F statements, the direction of the 
nuclear glide suggests on its own a subjective connotation, the nucleus can 
fall at or near the beginning, without blurring the doch connotation. The 
teduced stresses, if any, occur in the tail, where they are incorporated in the 
nuclear tone movement and do not of their own accord contribute to the 
expressive value of the tone pattern; e.g. 


A. What a wretched week it has been. 


B. , Yesterday was -not a -bad ‘йау (O'Connor and Arnold, 1961, p. 173). 
Gestern war's doch ganz schön, KS 


p 
А. Sixpence, for that small amount, д P 
B. „Sixpence won't Break you (O'Connor and Arnold, 1961; p. 173). 
Wegen Sixpence machst du doch nicht Pleite, 
А. I don't think we ought to tell him. 
B. ,Someone's «got to -do it (O'Conno 
Jemand muß es ihm doch sagen. 
А. Oh, I do wish I could go. 
B. Jm not «stopping уои (O'Connor an 
"Ich halte dich doch nicht zurück.12 


r and Arnold, 1961, p. 173). 


d Arnold, 1961, p. 173). 


In addition to the emotive prenuclear intonation ~ or instead of it - à 
more emotional form of the nucleus can emphasize — or express — the doch 
connotation. A challenging or censorious attitude can be expressed by 2 
rise-fall (В.Е) instead of a Е; e.g. 


A. Why didn’t you tell me? 
B. You didn’t “ask me (O'Connor and 


Arnold, 1961, p. 157). 
Du hast mich doch (ог ja) gar nicht 5 


gefragt. 


12. It is true that also utterances with an early fallin, 
German doch-sentence, namely when the words | 
here under discussion; e.g. I ‘told you it was ifoolish to \do it. Ich hab dir doch gesagt; & 
sei unklug.... Compare with this the followi 


182 Intonation and Grammar 


| 


А. Don't close the door. 
B. But I wasn't “going t0.!? 
Ich hab sie doch (ог ja) gar nicht schlieBen wollen. 


e the censorious attitude is expressed by the R F nucleus, there is no need 
Of a prenuclear part with this connotation; the nucleus may occur quite 
early, with the reduced stresses in the tail; e.g. 


A. Could you give me a tip every now and then? 
B. Why should I? Your ca^reer doesn't concern me. 
Deine Karriere geht mich doch nichts an. 
А. May I have some more trifle? 
B. There “isn't any ,more. You've “eaten it ,all (O'Connor and Arnold, 
1961, p. 157). 
Du hast doch alles aufgegessen.'* | Bi 


The FR instead of an early R is rather а down-toner than an intensifier. 


~ 
„Someone "s -got to -do it. Jemand muß es doch tun. 
"ch halte dich doch nicht zurück. 


Im Not *stopping you. 
ch brauch doch keine Angst zu haben. 


T і 
Ye no -cause to be afraid. 


a e ; ; 
те more mildly argumentative than Somebody's got . . Arem: 
Ат not... 15 


Also the RFR is possible, which tends to add a note of cheerfulness to 


t E 
he mildly argumentative FR; e.g. 


(To a child afraid of bees.) 
They won't -hurt you. 
1e tun dir doch nichts an. 
(То Somebody warning us.) 
GNE Said -nothing "wrong. 
^^ hab doch nichts Falsches gesagt. 
esterday was "по? a *bad "Фау. 


13. This sentence i Monfries (1963), Drill 34: ‘Protesting answers and 
Unjust соат E is instructed to use КЕ throughout. 
"chari One of the labels O'Connor and Arnold (1961) айасһ to BE Statements ist 
preg enging or censorious’ (p. 45). Kingdon says: ‘mocking or impatient; in some cases 
test against a false assumption’ (1958, P- 221). — і э „а 
by c; СЕ the down-toning effect of FR instead of R in flat contradictions, pointed out 
ou Connor and Arnold (1961, p. 62): A. I can do it on Monday. B. You ,can t (asyou 
oy to know perfectly well). You vcan't (and Гат sorry that you should think you 
A Kingdon (1958) gives the example: А. He hasn't taken them. B. ‘Yes he “has 
орено), у 


Мапа Schubiger 183 


^: The preceding FR and RFR utterances have in common the fact Шар 
© nucleus comesearly, with the reduced stresses in the tail. The final rise o i 
Occurs on one of these potentially stressed post-nuclear words and ки. 
Secondary prominence. In all these cases of intensified or tone Sei 
English tone patterns the German Ld is vd SC The varian 
as in English, by elocutional means. 

и with E or more reduced prenuclear stresses аза 
by a nuclear rise, as exemplified on pages 180-81, have a FR or ra M 
Е + В variant, too. The Е mostly occurs on a negative or restricti 


element of the sentence. This pattern adds a plaintive or pleading note m 
the protest; e.g. 


A. I do wish he'd mind his own business. 

B. But he was ‘only ,trying to be ,helpful (O'Connor and Arnold, 1961, 
p. 255). 
Er hat uns doch bloß helfen wollen, 

A. It’s an absolute scandal. 


B. There's ‘no need to ,get so worked тр about it (O'Connor and Arnold, 
1961, p. 255). 


Du brauchst dich doct nicht so aufzuregen, 


A. The amount of time one wastes there, 
B. You ‘didn’t have to wait ,long. 
Du hast doch gar nicht lan 


ge warten müssen. 
Cf. You didn't 


shave to wait „long (O'Connor and Arnold, 1961, p. 185). 


Here as elsewhere it is not alwa; 


YS easy to keep FR and F 4- R. neatly 
Separate. Although slightly differing tone Patterns have been claimed for 


them (Lee, 1956, p. 69), they are, to all intents and purposes, melodically 
identical, But while FR/doch + 


lenti akes us into the vicinity of concessiVe 
(limiting) anyway, jedenfalls; F 4- R/doch does not. It is purely emotional 
and could. also be rendered by really, wirklich. Compare the following 
du all with similar wording, taken from O'Connor and Arnold 


A. 
B. 


ar wo 


What a terrible waste of money! 
,You -didn’t «lose by it (p. 174). 
'Du hast doch nichts dabei eingebiiBt, 
A. I'm sorry about the mess, 
B. "You -couldn’t help it (p. 232). 

"Du kannst doch nichts dafür, 
Cf. “you + couldn't help it, v 


anyway, 
‘Du kannst Jedenfalls nichts dafür. 


184 Intonation and Grammar 


- Which in substance is not very different.!* 


A. Whatever made you pay him? 
B. It couldn't be a,voided (p. 185). s 
Ich konnte doch nicht anders. 
A. Trust you to do something silly. 
B. couldn't „help it (р. 255). 
Ich kann doch nichts dafür. 
Cf. I veally ,couldn't help it. 
Ich kann wirklich nichts dafür. 
which is about equivalent. +7 
The reaction of the speaker to his interlocutor's utterance or attitude 
can take the form of an imperative. Here the connotation is: ‘By the way 
you talk (or behave) one would think you didn’t know what was the 
Obvious thing to do.’ German doch, which here corresponds to French 
done, is chiefly used in utterances which are а reaction to somebody's 
attitude or behaviour; while eben (Alemannic halt), French eh bien, 


appears in a rejoinder to an utterance; D: 


(To a girl standing idly about.) 
\Give те a Мапа, Anna (Frisch, 1962, р. 46). 
Helfen Sie mir doch, Anna (Frisch, 1959, р. 234). 
Aidez-moi donc. 
go somebody laughing at our ignorance.) 
eae "а better ex plain it to me." 
E erklär es mir doch (Frisch, 1959, P- 236). 
Xplique-le moi donc. 
CH Somebody out of breath ап 
н оп" be in «such а hurry (Brech 
aben Sie doch nicht solche File 
е Soyez donc pas si pressé. 
М I can't make а nosegay with one 
` ıTry and | find an ‘other (Palmer, 1922, р. 
Such eben noch eine zweite. 
Eh bien, tâche d'en trouver une autre. 
way that we have come across in the textbooks 


d complaining of it.) 
t, 1957, p. 62). 
(Brecht, 1956, p. 94). 


single flower. 
74). 


16. All the sentences followed by апу 
ave a FR, ego P^ 
y: You needn't do that. There ‘isn't a 


17. Kin in the following ма 
Kingdon marks F + R in the fo d 
better one; for ‘at times the rising tone falls on what may be considered to be the most 


important word in the utterance’ (1958, P; 12. _ М 
- Our own translation. The translator's version js unsatisfactory. 


Maria Schubiger 185 


Tn English this kind of imperative often begins with well, corresponding 
to French eh bien, or ends with then, meaning in that case. In German it 
can begin with so, which also means in that case. However, these particles 
are not indispensable. The German/particle eben and the English tone- 
pattern are the main bearers of the connotation; e.g. 
A. The bus doesn't run on Sundays. 
B. Come by ‘train \then (O'Connor and Arnold, 1961, p. 125), 
So komm eben mit dem Zug. $ 
А. I wish Ann didn't dislike me so. 
B. Well don’t be so ‘rude to her in future (O'Connor and Arnold, 1961, 
p. 125). 
So sei eben in Zukunft etwas netter zu ihr.!? 


As with statements, the КЕ sometimes replaces the F, with a similar 
connotation; e.g. 
‘A. This apple is not quite пре, 
B. Take an^other one (if you don't like the one you’ 
k ошуе 
(Kingdon, 1958, р. 231). У Si 
Nimm eben einen anderen, 
А. I ought to invite her, 
B. Well then in^vite her 
So lad sie eben ein, 
Eh bien invite-la, 
А. This pen's useless, 
B. Well try a ^different 
Versuch's eben mit ей 


(O'Connor and Arnold, 1961, р. 160). 


one (O'Connor and 


Arnold, 1 4 . 
пег anderen, UP 


А. So far I haven't had time. 
B. !Start “now, then (O^ 
So beginn eben jetzt. 


Connor and Arnold, 1961, p. 167). 


attitude of somebod: inti i 
y pointing to what th i 
е ое e ie interlocutor should have hit 


Bonet. Imperatives with R do not, strictly speaking, belong here. They 

кыре ап obvious consequence of the interlocutor's statement but 

Pa sheer protest, like the statements with R mentioned in the remark 
pages 181—2. That is why German eber is inappropriate here; e.g. 


ч I'm going to sack him. 
- iDon't „до „that (O'Connor and Arnold, 1961, p. 191). 


Tu das doch nicht. 
М I'm terribly sorry. 
a Don't a,pologize (O'Connor and Arnold, 1961, p. 192). 
ntschuldige dich doch nicht.?* 


like the corresponding statements, have a 
nnor and Arnold, 1961, p. 249); e.g. 
„down (O'Connor and Arnold, 1961, 


With a FR these imperatives, 

y of plaintive pleading (see O'Co 
t ido „that. Don't let it get you 

D. 259). ‘ wel yt 


The reaction to the situation here under discussion, though it cannot ђе | 
ive form. Here the 


a 4 А 2 х ` 
Teal question, is sometimes cast in the interrogati 
leus is a F or RF, 


e Ee f у А 
erman particle is denn. With special questions the nuc 
ng statements and impera- 

negative why questions in 


Wi m я | 5 

үп similar connotations as in the correspondi 

di : what questions correspond to statements, 
Present tense to imperatives; ©.5. 


пр Somebody bothering us with his personal problems.) 
“What's it got to -do with us? (assem, 1952, p. 71): 

às geht denn das mich an? = Das geht doch mich nichts an. 
п a person's identity.) 
2, p. 56). 
h, 1959, p. 241). 


Ше Somebody refusing to believe i 
"E "else can he ‘be? (Frisch, 196 
e kann er denn sonst sein? (Frisc 
О somebody complaining of his tight new shoes.) 


1 D " 
ћу don't you wear an "old ,pair? 
9 somebody who has tried in vain to ring up a friend.) 


i i S 
Лу don't you ‚write him a “letter ? 


of RF imperatives *shrugging off responsibility" (p. 


nds’ (1958, p. 231). 
Arnold say of imperatives with this pattern: 


20. o 

48), ү Connor and Arnold say 

1 Ingdon: ‘impatient сотта 
* As with statements, O'Connor and 


у 
Te к 
Proving criticism’ (p. 532). 


Maria Schubiger 187 


mu. 


Cen 
(То somebody perplexed by а predicament.) 
Why don’t you “do something about it? 


In German the imperative is much more appropriate here than the 
question form. The same holds good with French. 


| Zich doch alte Schule an. Schreib ihm doch einen Brief. 


| 
- Tu doch etwas in dieser Sache. 


| 'Ecris-lui donc une lettre. 


Remark. There are many borderline cases, where one is in doubt whether 

the utterance is a real question or a veiled statement; for real questions, 

too, can have this intonation, which connotes surprise, often unpleasant 
| surprise; e.g. ‚ 


Ж) A. You must let me in. Гуе got a season ticket. 

В. Why didn’t you зау so be fore (O'Connor and Arnold, 1961, р. 123). 
Warum haben Sie das denn nicht gleich gesagt? 

: „What are you “talking about? Von was redest du denn? 


He (To somebody who says that his guest does not like wine.) 


У „Нож d'you ‘know he ,doesn’t like it? 
2 Wieso weißt du denn, daß er ihn nicht gern hat??? 


Тће surprise сап also be caused by something ог somebody catching our 
eye. The nucleus falls on the item that causes the surprise. 


E What are ‘you doing in the ,Park? (Schubiger, 1935, p. 28). 
` Was tust denn du hier im Park? 

What did ‘she want? (Schubiger, 1935, р. 31). 
_ Was wollte denn die??3 


Incidentally the last two examples point to another more intellectual 
. function of the German modal particle, When placed before a persona 
. pronoun or similar normally unstressed word it indicates that this 60100" 
word bears a full stress, In the English corresponding sentences the context 

alone guides the speaker — or reader — in the placement of stresses. 19 


22. Many more examples are to be found in 
23. O'Connor and Arnold say that with E 
. reaction to something very unexpected, and, for that i ediately 

pleasing to the questioner (p. 40). With low аве 


head: ‘somewhat unpleasantly surprise! 
(p. 109). Jassem says: ‘surprise and , be Dias usi te pF SD 
(1958, p. 125). Protest” (1952, p. 71). Kingdon: ‘mystificat! 


Allen's Exercise 70 (1954, pp. 13-4). d 
ресїа1 questions this pattern expresses 


| print italics are often made use of, for instance in the above two sentences, _ 
both from Galsworthy’s Forsyte Saga. 


Here are some more examples: 
Da kann doch ‘ich nichts dafür. 
It's not ‘my fault. 


Was versteht schon ‘die von Politik. 

What does "she under stand of , politics? › 

Da bin wieder пећ nicht auf der Höhe (Eliot, 1962, р. 715). 

That's not ‘my line of action, you know (Eliot, 1873, p. 186). 
Aber wie kann denn 'ich Schmuck tragen, WC du als die 

Altere keinen tragen willst? (Eliot, 1962, p. 20). Y . 
But how сап Ч wear "ornaments, if “you, who are the elder sister, 


Will ‘never ‘wear them? (Eliot, 1873, pt. I, P- 13). 


an Е 
General questions, which in substance amount to statements, have 


Nucleus; eg. 


A. Sto " ; 
p grumbling about it. 3 old 
В, Would „you -like your -garden «trampled over? (O'Connor and Arnold, 
1961, p. 181). | EV. 
ürst denn du beglückt, wenn man auf deinen Blumen 
(To somebody worried about the Jones's opinion.) 
1 oes it matter what they "think? 
Sts denn so wichtig, Was sie von 
ist doch egal was sié denken. 
(То somebody wrongly blaming me.) 
3 It my fault that you have failed? e 
in denn ich an deinem Miferfolg schul 


(To so wing unmotivated surprise.) 
mebody exhibiting (0 Connor and Arnold, 1961, p. 51). 


Is ij E 
Is И so very sur,prising í 
t es denn so erstaunlich ^^ 


dir denken? 


The German particles being 50 very numerous and differentiated, while 
the tone anes 5 of English, at least in so far as they an соруен ie 
scribed, are less so, it is not surprising that EN EE x E 
than dock conespond to the tone patterns treated in the preceding Т Та 
е difference of meaning, Where not suggested by the contents ап у ° 
Context, is in English ren dered by voice quality. Here are some examples: 
24. O'Connor and AP say that this pattern almost invariably expresses dis- 
а 
Prout or scepticism (1961, P- 5D 
+ See footnote 2. 


Maria Schubiger 189 


а Ре 


ka 


1(а) ‘Come їп. Kommen Sie dech herein. 
Su *Go ,on. Fahren Sie doch weiter. 
*Cheer ‚ир. Nimm's doch nicht so zu Herzen. 
Here the adversative connotation is still felt to be present. 


-(b) “Соте ‚їп. Kommen Sie nur herein. 
Ў *Go ‚оп. Fahren Sie nur weiter. џ d 
A There is no adversative element here. The tone is encouraging. 


2a) “Туе said -nothing «wrong. ‘Ich hab doch nichts Dummes gesagt. 
Adversative element still present. 


(b) “That's a sur-prise for "уои. Das ist mal (or aber) eine Überraschung 
für Sie. 
^* Here's a thing you *don't see too -often. 
d Das ist mal etwas, das man nicht jeden Tag sieht, s 
The adversative connotation has faded; both mal and the English 
tone pattern are solely expressive of emotion. 


A. Why didn’t you tell me? 
B. You didn’t “ask me. 

Du hast mich doch gar nicht gefragt. 
Adversative element still felt to be present, 


(61) A. Тћеге' be about ten I Suppose, 

- B. There'll be ^more (O'Connor and Arnold, 1961, p, 157). 
Es werden sogar noch mehr зет. 

(b2) A. Why didn't you inform me of this? 

B. It was strictly confidential. 7 didn’t 

Ich hab’s nicht einmal meine: 

"4p Here the RF can have а con 


| could be replaced by even more 


© A. You seem to know t 
B. Гуе “lived here for a Mong ште. 
Ich wohn auch schon lange hier, 


Auch has here a causal meaning, which in English is suggested БУ 
the RF, 


itell my “husband. 
m Mann gesagt. 


cessive connotation. With b, mos 
5 With b; not stands for not even. 


his part of the Country very well. 


4(a) Does it matter what they think? 


Ist’s denn nicht egal, was sie denken? 
Es ist doch egal, was sie denken? ? 
The question amounts to à statement, 


190 Intonation and Grammar 


= i 


' (b) (Doctor to patient) Have you been ‘smoking again? J 
Haben Sie etwa (zufällig) wieder geraucht? 
Auriez-vous (par hasard) recommencé à fumer? 


Am I 'late? из 
Bin ich etwa verspatet? P ANS OF Еф 
Serais-je en retard? PENT ce = e 
(To somebody fumbling for his key.) e Ate: E EA 
Have you for'gotten it? : * 
Hast du ihn etwa vergessen? WES. ` be 
А. We had a meeting last night. WOO de 
d'Arnold; 1961, р. 219) 
К EO 


В. Should ‘I have -been there? (O'Connor ап 
Hätt ich etwa dabei sein sollen? 
A. Don't be so cut up about it. 


B. Were ‘you pleased? 
, Warst du etwa erfreut? 


Etwa expresses a suspicion that the truthful answer must be yes (or 
По). In English there is often а high rise, i.e. a glide from a medium to à 
high pitch.?? The suspicion may be a virtual certainty. That is also why 
Some of the rhetorical general questions on page 189 could in German 
have etwa instead of denn. у : 

Of all the German unstressed modal particles none, probably, has а 
8reater variety of connotations than doch. We have here confined ourselves 
to one of its functions. In conclusion we will cast a brief glance at some 
Other doch-sentences and their English counterparts." With some current 


Ones thy ; ; of syntax, e.g. the doch which makes 
hp UE Aen din Du hast doch die Fenster 


à statement i nfirmation ; e.g. 
ent into a request for co ` % you? Doch nicht etwa 


Beschlossen? You have ‘shut the windows, Jet. ў 
With th i tense expresses apprehension; e.g. Er wird 
РОТА sein. ‘Don’t е me he ‘fell into the 


doch nicht etwa in den Teich gefallen 

роп, * 
The purely emotive, exclamatory doch has its counterpart man English 
tone pattern with great pitch differences, either wide glides or wide jumps; 
eg. 

27.0' high rise in general questions is light and casual 
Connor and Arnold say that a high ‘refers to a situation in which the listener 


(1961 ez 
3791, p. 210). Ji that the high rise p 
15 involved у. ee (1952, р. 78). Also German etwa zufällig, French par 


asard indicate th eaker wants to make his question sound casual. 
28, Schneider GH to doch a variety of connotations, ranging from force of 
Thetoric to self-complacent assertion, and — in imperatives — anxious pleading or 
entreaty (1959, pp. 275-7). For a detailed study of the English locutional equivalents 


German doch see Pestalozzi-Schárli (24). 


Maria Schubiger 191 


M 
рә 
о ee D 


Du bist doch em rechter Esel. "You „are an a 
- Das ist doch lächerlich. "That's ridiculous. a 
-Es war doch wunderschön. "Wasn't it „wonderful? 
Das ist doch ein práchtiges Buch. 
"This is a „lovely book; or: "Isn't this a „lovely book? 


i i һе 
: 29. It is interesting to note that even purely emotive doch has retained a аа 

Ec *as you know’: it is only appropriate in exclamatory sentences conc! 
something the interlocutor has had a share in. ? : "А 
30. Here а locutional and ап elocutional means of expression аге pube. ES 

i i i inting, lik ience, an 

_ English: the interrogative form, pointing, like doch, to a shared experie F WS 
emotional tone pattern. An exclamatory sentence about something new to t 3 че 
locutor cannot be in the interrogative form, nor can it in German comprise doch; e.g. 


You should have joined us; it was wonderful. Du hättest mitkommen sollen; es War 
| wunderbar. 


References 


ALLEN, W. S. (1954), Living English Speech, Longman. c 

ARNDT, W. (1960), *Modal particles in Russian and German', Word, vol. 16, 
pp. 323-36. 

BoLINGER, D. L, (1961), Generality, Gradience and 

BRECHT, B. (1956), Der gute Mensch von Sezuan; 


BRECHT, B. (1957), The Good Woman of Setzuan, 
Grove Press. 

Ё CATFORD, J. С. (1964), ‘Phonation types: the classificatio; 

I components of speech Production’, In Honour of Daniel 
pp. 26-37. 


CoLLINson, W. E. (1938), *Some German particles and their English equivalents. 
A study in the technique of conversation’, German Studies presented to Prof. 
Н. С. Fiedler, Oxford University Press, pp. 106-24, 

COLLINSON, W. E. (1954), 


The German Language Today: Its Pattern and Historical 
Background, Hutchinson. 
CousTENORLE, H. N., and ARMSTRONG, L. Е. (1934), Studies in French 
Intonation, Heffer, 


D ÜRRENMATT, F. (1956), Der Besu 


the All-or-None. Mouton. 
Suhrkamp. 


trans. E. Bentley and M. Apelman, 


n of some laryngeal 
Jones, Longman, 


драп! 
Eggeling апа К. Wildhagen, 
ELIOT, George (1873), 
ELIOT, George (1962), 
FRISCH, M. (1959), 
FRISCH, M. (1962), 
Methuen. 

GÜTTINGER, F. (1963), Zielsprache, Theorie u i nesse 
GE HE nd Technik des Übersetzens, Ma 


osition, edited by H. F. 
Middlemarch, in four vols., Blackwood, 

, Middlemarch, trans. Ilse Leisi, Manesse, 

Biedermann und die Brandstifter, Spectaculum II, Suhrkamp. 
The Fire Raisers. Three plays by M. Frisch, trans. M. Bullocks 


ones of English’, Archi inguisticum, | 

ve eee on [ч irchivum Linguisticum | 

pe ita S. (1959), “Information Points in intonation’, Phonetica, vol. 4, | 
рр. 107-20. 


HULTZÉN, L. S. (1962), 


“Significant and n 
of the Third Internation, 


on-significant intonation', Proceedings 
al Congress of Ph 


onetic Sciences, Mouton, pp. 658-661. | 
192 Intonation and Grammar ; d 


| JasseM, W. (1952), Intonation of Conversational English, Sklad Glowny: Dom 


eS 
D REN R. (1958), The Groundwork of English Intonation, Longman. 
E, W. R. (1956), * Rise-fall intonation in English’, English Studies, vol. 37, 
pp. 62-72. 
Patterns, Macmillan. 


Момғалвз, Helen (1963), Oral Drills in Sentence 
ONNoR, J. D., and ARNOLD, С. Е. (1961), Intonation of Colloquial English, 


P Longman. 
RES E H. E. (1922), English Intonation. With Systematic Exercises, Heffer. 
STALOZZI-SCHÁRLI, Annemarie (n.d.), Die Wiedergabe des unbetonten doch im 
schen Diss. 
Ee K. L. (1945), The Intonatlon of American 
8 1 
т парик, W. (1959), Sti 
JBIGER, Maria (1935), The Role of 
мү езу Press. 
OBS Maria (1958), English Intonation, 
RES G. B. (1925), Die Heilige Johanna, trans. 
ee С. В. (1936), Saint Joan, Tauchnitz isti 
ER, G. L. (1958), *Paralanguage", Studies in Linguistics, Vol. 13, PP- 1-12. 


English, University of Michigan 


atik, Herder. 


listische Deutsche Gramm 
Spoken English, Cambridge 


Intonation in 
Its Form and Function, Niemeyer. 
S. Trebitsch, Fischer. 


Maria Schubiger 193 


10 Richard бипїег 


Intonation and Relevance 


А paper written specially for this volume. 


Sentences are fairly easy to isolate in samples of human speech, and that 
fact leads linguists to make a great, unspoken assumption — that the 
sentence can be adequately treated in isolation. This assumption may be 


а convenience, but it is seriously misleading, for it tends to obscure 
important linguistic facts and relations. $ 


A sentence usually occurs among other sentences; it is, in fact, usually 
connected to them in some way. А sentence is most closely connected (0 
its context sentence, which is often the one just preceding, It is useful to 
say that a sentence is a response to its context, and is relevant to that 


context. These notions can be illustrated with the following two-line 
dialogue: 


A (Context): Where is John? 
B (Response): 2 He’s in the 3 HOUSE1| 


In this dialogue, Sentence A is 


s0$-——— —R, in whi 
Tesponse, so that one person may ћ 


194 Intonation and Grammar 


argument is this: a context sentence acts as 
revealing details about that response, and 
The investigator who removes a 
ght; thus he may obscure the very 
Illustrations will show what is 


But an even more powerful 
а floodlight upon the response, 
Clarifying its structure and meaning. 
Sentence from its context shuts off that li 
facts that he is trying to understand. Some i 
Meant. 

If we take an utterance like 3 John 1 4, we cannot discern much about its 
Structure or meaning. But the moment we make it relevant to a context, 
the structure and meaning leap into focus, as in the following: 


Context: Who is in the house? 


Response: 3 JOHN 11 

Instantly the observer sees that this response is elliptical, and that it has 
the underlying structure 3 ЈОН. N is in the house 1 V. It is the context that 
allows this interpretation. But the very same phonetic sequence 3 J OHN 
1 | if taken in a different context, is revealed to havea completely different 


Structure and meaning, as in the following: 


Context: Who did they see? 
sponse: 3 JOHN 1 | 


SCH e full form of this response is 
Which the sequence 3 JO HN 1 Vis no 
m illuminates the structure and m 
hi amples of the utterance 3 J OHN 1 | appear 
m lation, but different contexts allow us to be them as 
erent; indeed, those contexts compel us to do 50. A à 
Pronouns furnish another striking example of the SE ү of ES 
p tation in light of context. If we take а ае А GE i EK e 
RN 14 | t the meaning of the subj 
we can tell little abou! à 
that its Feeder must be singular and male; but the mE we put this 
Тепсе in a dialogue as а response, We get further information: 
с 
Ontext: Ze John in the house? 


sponse: 2 He's in the 3 BARN 1 + 


2 They saw 3 JOHN 1 |, a sentence in 
w the object. Again, it is the context 
eaning of this response. Thus two 
to be identical if taken in 
fundamentally 


simulate another person to perform the practical response Ули Se ES до! 
5. explicit is the extended exchange of speech, which we might write аз 10 ows: 
ОИЕ etc. Such language рае AI on in- 
піке ааа и different sorts: On ће one hand there may 
y, а damentally д 
e по Ee e e foregoing, as in the dual monologue of two actors 
К а Stage, in whi ks Së сакз alternately, but neither 15 addressing the other; on the 
ther hand th ich each sp Togue, in which each speech is relevant to the one that 
ere is true dialogta, the study of the devices that indicate 


Relevance The linguistic study of dialogue 15 


Richard Gunter 195 


Now we know that He means John, and that piece of information is 
important; indeed, in a real conversation it might be crucial. Our usual 
way of talking about pronouns is all oriented to the speaker's point of 
view, the point of view of production. That is what leads us to say thata 
pronoun is a ‘substitute’ for a noun. But in understanding dialogue the 

_ point of view is quite different. The hearer must take a pronoun like He 
as a context signal that tells him something like this: Go back to the last 
singular, male noun in the context; that noun will be the meaning of this 
pronoun.? 

Context may have an important bearing even upon the lexical meaning 
of an utterance. For example, if we isolate such an elliptical form as 
3 WAN 1 {itis ambiguous, but a context may reveal with great precision 
the lexical meaning of this utterance, as in the following: 


Context: Do you mean they lost or won? 
Response: 3 WAN 1 | 


Or again: = 


Context: How many fish did you catch? 
Response: 3 WAN 1| 


х Clearly it i$ the context in each of these dialogues that refines the mean- 
ing of the ambiguous sequence 3 WAN 1 L 

For а final example of the way a context may floodlight the response, We 
may take such a sentence as They are flying 3 PLANES 1 |. In isolation 


this sentence may be ambiguous, but it becomes quite clear in such а 
context as the following: 


2. The rules for unravelling anaj i 
А е рһога are really much more complicated than this 
illustration may seem to argue, as in such a dialogue as the following, for example: 
Context: John hit Bill. 
Response: Did he hurt him? 


, and anaphora generally, su: ing О 
| ggests that the making 
language and the understanding of language are quite different kinds of activity. The 
idea that pronouns are substitu 


196 Intonation and Grammar 


Context: What are your friends doing? 
Response: 2 They are flying 3 PLANES 11 


Or in a different context: 


Context: What are those things? 
Response: 2 They are flying 3 PLANES I 1 


The moment that They is identified as animate or inanimate through 
context, the structure of flying 3 PLANES 1 | also becomes clear. In fact, 
there seerns to be no way to understand flying 3 PLANES 1 | except to 
determine first the meaning of Тлеу. 

Thus to inspect responses in their contexts reveals features of those 

Tesponses that are not visible in isolated sentences. One of the features of 
English sentences that can be illuminated in this way is intonation. 
The analysis of intonation that is widely known as the Trager-Smith 
system has served an important function: it has invited testing, and thus 
has stimulated linguists to gather new and interesting data for that pur- 
Pose, But there is now widespread doubt that the system 15 adequate to 
Organize the observations that it has made possible. The analysis is 
inadequate, or unsatisfying, in at least three ways. First, there is ground 
for doubt that the stuff of intonations is a set of discrete phonemic 
Pitch levels and terminal junctures. Second, even granting that the analy hie 
into phonemes is correct in principle, there are not enough entities in the 
Trager- Smith inventory to go round, for there are intonations e cannot 

Written in the system at all. Third, the Trager-Smith analysis has never 


allowed us to see clearly what it is that intonation does in um = We 

ìt means, One may take an intonation in the abstract, divorce from words, 

and may try to assign a meaning to it, but that task is bafiling. It is ap 
i i intonation and the 
n connections between ап Inton: г 

Мей шыу ts of the sentence with which that 


internal 1 ical faci 
4 Semantic or grammatica d 
intonation occurs. Such connections are at best elusive, and it may be 
biguous in linguistic discussion, for 
e. First, there is the notion that an 
tituent structures; second, 


3. The word ambiguity is itself sometimes ап! 


SE in the first sense may be fai 
é Sense is not very common ехсер у 
Peculate on; for context usually makes it cleat W 


TUctures 3 R ion like flying t 
р is meant in an expression | book came to my hands after this paper was 


Richard Gunter 197 


that they do not exist at all. There are sentences that can take many 
different intonations; there are intonations that can occur with all sorts 
" of sentences; and – most telling of all — there is no string of words that has 
E ecessary intonation. 
Ka bro Ee that we cannot confront the first two objections to the 
rager-Smith system until we have somehow met the third, for until we 
know what intonation means we cannot say whether two intonations are 
phonemically the same or different; we cannot know what kinds of 
primitives intonations are built of, or what number of primitives there are, 
. until we have understood which intonations mean the same and which 
mean different things. The most important question before us, then, is 
this: What is the role of intonation in English ?* 

This paper, though it falls far short of a complete answer to that ques- 
tion, does present conclusions about the role of simple intonations in one 
." area of usage — the two-line dialogue, examples of which we have already 

. seen above. These dialogues are made up of simple sentences like John 
ү drank tea and John is in the house and The play was wonderful. Every 

- simple sentence of these three kinds has many forms besides the affirmative 

statement: each has question forms, elliptical forms, the negatives of 
these, and others. Several of these forms will be used in the illustrative 
dialogues to follow. 

- Simple intonations will be used with the responses of these dialogues. 

A simple intonation is one that accents a single syllable, as in 3 JOHN 1 V 

or 3 NObody is in the house 1 L А contour begins with the accented syllable 

and continues to the end of the sentence; if there are syllables before the 
. accent, as in 2 The play was 3 WONderful 1 |, their intonation will be 
4 called a precontour. These definitions exclude any intonation like 3 JOHN 
= 112isinthe 3 HOUSE |, which accents two syllables, and is therefore 
not simple. 

In all of the dialogues to follow, t 
second is a response that is relevant t 
is the response, for certain claims а 
that occur with responses. 


n 


he first speech is a context, and the 
О that context. The focus of attention 
те to be made about the intonations 
The claims to be Supported are these: 

| 1, The welter of different simple intonations that are possible in this изаве 
+ can be reduced to a small number of significant sets of contours and 4 
non-significant set of precontours, 


bted to: Pike (1945); Trager and 
3 Stockwell (1960; 1962). I am grateful t 


198 Intonation and Grammar 


= 


2. These contours are not made ир 0' 
terminal junctures. 


3. In this usage an intonation mea 


f discrete, phonemic pitch levels and 


ns nothing in itself, nor is it dictated 

by the internal semantic or grammatical facts of the sentence with which 

it occurs, It may signal something about the emotional state of the speaker, 

but such * expression' is a minor, unstable part of the intonation's meaning; 4 
the stable, testable meaning of an intonation in the present usage isthe 
manner in which that intonation connects the response to the context. 


The simplest possible English intonations аге those that occur with 
monosyllabic utterances like John. In these intonations one is dealing only 
With accent and contour. The contours are short, since they cover only one — 
syllable, and there are no precontours at all, since there are no syllables 
fore the accent. Some of these intonations are given below in the — 
Trager-Smith symbology. For reasons that will become clear їп a moment, ` 


the intonations are arranged here in three sets. Each intonation should be М 


Imagined as spoken with a monosyllabic utterance like John:* E 
| 


А 

421 В G 

4\ 441 431 

al 33} 4 

sl 9294 324 

sch 221 311 
bes ER Mort 


This representation deals in d 
th ese elements are ‘phonemes’, Wita a 
© Word implies. Thus the implication is prese 


RS i . For example, 41 Land 31 | are just 
lutely different from every other. Б роугег 25 either is from, say, 331 


35 different f; ESTE 

rent from each other in 5121 i eras srl 

D 324. But the behavior of these intonations In dialogue is distinctly — | 
use the terminal arrows in their original 


5. Tra i 

ger ii not, of course, 
M е WE the terminal junctures were ДЕЛ) E due Wees 
ill notice that nowhere in the present paper is there any use Of Te + 


iscrete elements of pitch and juncture. 


with all the dogma and doctrine that d 
nt that each intonation 15 


Метад resent. In fact, this writer does not believe 
ce to the enti t it purports to represen’. ^ h 

at there is Marium, D NES at least that 15 significant а the hem 
розе, for illustration, that we have à dialogue such as T Se ith a resp 

етт ager-Smith symbols might be written with a level arrow: 


Co; 
R og Who is in the barn? 
Onse: 2 John 2> f жү 
reted either as high-rising or as 


The claim $ , i uld be interp: 
loy © Claim is that such an intonation Ee the meaning Coul id it be John? or the 


ШЕЯ 3 
me, Sing, that is, it would be assign 
ng Zt is John. 


Richard Gunter 199 


against this implication; for within one of the sets all the intonations 
behave alike. This fact should not be surprising, for all of the members of 
a given set closely resemble each other in that they share a gross shape: 
the members of A are grossly falling; those of B are grossly high-rising; 
those of C are grossly falling-rising. These gross shapes can be schematized 
with reference to a base line as follows: 


falling high- rising falling- rising 


base fine с ARE a }. м, 


Thus each set of intonations can be regarded as a contour with a recog- 
nizable shape, and each member of a set can be regarded as a variant of 
that contour. In a given dialogue, moreover, all of the variants within a 
contour signal exactly the same relevance, as in the following: 

Context: Who is in the house? 
Response: 3 JOHN 1 | (Relevance: Answer to information question.) 


This relevance remains intact with an 
whether 41 |, 31 | or 21 |. To be sure, е 
haye its own flavor in this dialogue, 


у variant of the falling contour, 
ach of these variants may seem to 


| Z 1 but that flavor is emotional or expres- 
Sive. In a given case it can be paraphrased with This is all so dull or You're 


silly not to know that or something of the kind. This emotional flavor is 
not very stable, however; as we shall see later it depends as much upon 


non-linguistic facts as upon the exact tonetic details of the intonation. 
What is important about these falling variants is that they all have the 
same gross shape. 


| All signal the same relevance here; they all answer the 
question, | 


Turning to the high-rising contour and its variants, commutation again 
shows that all variants signal the same relevance, as in the following: 
Context: John is in the house. 

Response: 3 JOHN 3 1 (Relevance: Reclamation.) 


Once again, each variant of the high-rising may have its own expressive 


overtones in this dialogue, and these overtones may range from faint 
Surprise to incredulity, but they do not affect the relevance. which with 
] 


6. This paraphrase of the relevance at hand i d 

l 5 one of the briefest; most such par? 
рае cumbersome, аз in Gunter (1966). In the present papel use simple 
Blosses for each relevance, or merely names for kind: tion 
шош (1957). 5 of relevance, such as reclamation 


200 Intonation and Grammar 


f analysis can be made of the falling-rising 


Exactly the same kind o 
dialogue as the following: 


contour and its variants in such a 


putent: Nobody is in the house. 

esponse: 3 JOHN 1 1 (Relevance: You forget John.) | 
the variants of the falling-rising contour 
t relevance: the variants 431, 324, 311 
though each variant may 


Yet again, commutation of 
Over this response does not affec 
and so on all signal the same relevance here, 
Signal its own expressive overtones. 

Thus the behavior of simple intona 
Pep опаЧопа! curves of a given gros: 
th ir emotional coloring. To state the same со 

езе curves are not made up of phonemic pi 
үче. The important thing is the assignability of a given variant to 
th Proper contour, of which it is merely one manifestation. Ata stroke, 
BM er great reduction is made in the number of significantly different 

Onations in this usage; accordingly, а single writing can be used for 
сасћ of the three contours. Hereinafter the writing 31 | will be used for 
the falling contour, 33 | for the high-rising, and 31 1 for the falling-rising." 
SN of these writings stands for any of a large, but unknown number of 

iants that fall within each contour. E 
surprises for the 


КЕ There is another aspect of this reduction that holds 
„Experimenter who examines it for the first time. The two dialogues below 


wi : 
ill show what is meant: 


Dialogue A 
Ontext: What did he drink? 
€sponse: 3 TEA J | (Relevan 


Dialogue B 
Ontext: John drank wine. 


sponse: 3 TEA 1 | (Relevance: Contradiction.) 
hese dialogues the segmentals are the same, 


and both responses have the falling intonation. Now as has already been 


У : Я У 
argued, either response can be rendered with any variant of the falling 
i ill fulfill the indicated relevance — answer 10 


d contradiction in B. What is surprising — 


is that one can render the first response 


heme to classes of students first led me to 
instead of discrete pitch levels. Students 
dless difficulty marking the levels. There 
d linguists have the same trouble (see 


tions in such dialogues argues that 
s shape mean the same, whatever 
nclusion in a different way, 
tch levels and terminals as 


ce: Answer 10 information question.) 


In the responses of both t 


pitch-level sc 
th contours 
but have en 
at practice 


(mS difficulty of teaching a P 
ia ing that one is dealing W! 
TB: so; grasp the contours quickly» 
Some experimental evidence th 


L 
herzen, 1965). 


Richard Gunter 201 


Lor 


H 


with some falling variant, then switch that response bodily to the second 
dialogue, and it will then fulfill the indicated relevance in the second dialogue 
— contradiction. The experimenter may sense a rather abrupt change in 
expression as the switch is made, but nothing will be lacking in the indica- 
tion of relevance. Any falling variant will signal either of the two rele- 
vances; therefore, either response can be rendered with any variant of 
the falling, then switched to the other dialogue, and it will there fulfill the 
relevance indicated for that dialogue. А 
But context appears to have a powerful sway over our feelings about 
the very tonetic facts of a response contour; indeed, context seems to 
govern our very perception of those facts to a surprising degree. Perhaps 
the two dialogues below will make clear what is meant: 


Dialogue C 
Context: Nobody is in the house. 
Response: 3 JOHN 1 | (Relevance: You forget John.) 


Dialogue D 
Context: John is in the house. 
Response: 3 JOHN 1 | (Relevance: Surely you don't mean John.) 


Here again both responses have the same segmental phonemes; and 
both employ the same contour. Any variant of that falling-rising contour 
will fulfill the indicated relevance in either dialogue; and again, one can 
render the response of one dialogue with a given e of the falling- 
rising contour, then switch it to the second, and the relevance indicated 
for the second will be perfectly preserved. { 

The experimenters who attempt switch for the first time often express 
surprise and unwillingness to believe the assertions that have been made 
here. But they invariably accept those assertions after further practice with 
such experiments, Their doubts take two forms, The first is that the experi- 
ЕА зае finds it difficult to believe that the variant in question 
as ES the same, even when he himself is the utterer, and tries 
Bras SC ges uar variant the same during the switch. The 

2 е assertion th: i i E 
Ge 10 SR truly fulfills ire: P КАП itis 
ched, or that it “sounds natural’. Doubters are quite ready to ђећеуе 
Ze SCH performers are rendering the contexts and ыр, but are 
not so ready when they themselves render the switched responses. FoF 
example, experimenters called upon to switch 3 JOHN 1 f Se Dialogue 


C to Dialogue D above will often feel the impulse to exaggerate the rise 


SC fall of the variant as it is switched to D. They seem to feel dissatisfie 
with, say, the Trager-Smith 21 { as a response in D; they feel impelled t° 


202 Intonation and Grammar 


make it something like the T-S 411. Yet curiously they are much readier 
to accept the 214 when it is rendered by someone else, especially if the re- 
Sponse is accompanied by some gesture that seems to make it appropriate.® 
The cause of these odd effects is hard to pin down, but perhaps it is this: 
As we have seen, an isolated sentence may not have a clear meaning or 
Purpose in itself, but takes on an essence when supplied with a context to 
Which it is relevant. The two become an organic whole, in which the 
Tesponse is suffused with the meaning of that whole. When we tear a 
Tesponse away from its context we cannot fully rid ourselves of the memory 
Of that meaning with which it was suffused; but when others are performing 
Contexts and responses, the onlooker is not so intimately involved in the 
Matter, and tends to accept what he hears as perfectly normal exchange of 
Speech. In short, the surprise that switch occasions in us has little to do 
With the exact tonetic facts of responses, but lies rather in the changing 
Context, which frustrates our expectations and reveals to us that our per- 
ceptions are partly illusion tinctured with memory. 
3 Switch bolsters the notion that relevance is a kind of constant, a more 
linguistic’ notion than is expressiveness. We can divorce the two, and 
thus simplify the data that intonation presents — а desideratum that has 
long haunted the dreams of scholars. But to put expression aside and to 
Proceed without it by no means allows us to Say that expressiveness 15 
unimportant, for it is important; it is all-pervasive in real сору оор, 
and is thus a proper object of linguistic study. Also, ìt 15 complex in the 
SXtreme, seeming to defy every generalization that the investigator р 
forward. One may think that he has found a stable expressive monie for 
Some variant, only to find upon further reflection that the ow ee 
‘tumbles, For example, suppose that the response of dialogue above is 
Tendered with some low-pitched variant of the falling contour, Say, 
7S 21 |. Such a rendition seems colorless in that dialogue, ina nek 
Unpleasant, Then the investigator May go on to note that when this 
Tesponse is switched to Dialogue B it suddenly seems to connote a smug 
assurance, Thus the investigator may think that he has discovered a law: 
in Dialogue A is colorless, but in B is smug. But the generalization 
commutation, switch and the fulfilling of relev- 
tal procedure. The first is the monitoring of one's 
for example, one renders a dialogue aloud, then, 
n of the response, the experimenter switches it to 
e involves at least three performers, A, B and C. 
m + 2 M variant; B renders a context and А 
Sor атышты ы; then C renders а second context and A again 
tio ponds with his memorized item. у taking no active part in these rendi- 
iments with tape splices could be made, but I have 


n 
5, Stands aside and judges. EXperim à 
ееп able to carry out such experiments for want of the proper equipment. 


DS assertions made here about 
Own est upon two sorts of experimen 
ћој Performance: in studying switch, 
Ing in memory exactly the renditio: 


ап g 
Other dialogue, The second procedur ) 
intonational 


Richard Gunter 203 


crumbles when the investigator realizes that he has been assuming а 
= gesture all the time. It is true that 21 | may sound smug in Dialogue B; 
~ ^ but imagine that it is rendered by someone who is tactfully correcting an 
| error and the very opposite effect comes through. 

The divorcement of relevance from expressiveness does not give us a 
ready way to handle expressiveness, which remains refractory and elusive 
in the extreme, influenced as it is by gesture and by the situation in which 
Speech takes place; the divorcement merely allows us to proceed with the 
study of the way in which intonation marks relevance — a study that 
permits conclusions in which we can feel more confidence. 

There are, then, two sorts of argument for the unity of all variants of а 
given contour: commutation of variants over some response in a dialogue 
leaves the relevance unchanged; and switch bolsters the notion that all 
variants within a contour are indeed the same. Thus are established three 

significantly different contours in the usage at hand. 

. But there is a fourth intonation that disturbs the harmony of this system. 
This is the low-rising contour, which can be written 11 4. This contour 
Presents several problems. First, though it seems most to resemble the 
high-rising variants, it contrasts with them in the marking of relevance, | 
as in the following dialogue: n 


Context: Who is in the house? | | 
Responses: | 
(High-Rising) 3JOHN 31 (Relevance: Could it be John?) 
(Low-Rising) 1JOHNI 1 (Relevance: Answer to information 

question.) 
bi The relevance to context marked by these two contours is clearly 
different in the two responses; the low-rising contour is therefore not to be 
grouped with the high 


› -rising variants, Generally, the hearer need only 
perceive that a variant h: 1 i 


ү 


огеоуег, indicates that the low-rising variants 
mmon with the falling variants, for i? 


case after case low-rising and falling mark the same relevance іп SUC 


dialogues as these: 


Context: What did John drink? 


| Responses: 3 ТЕА 1 | (Relevance: 


Answer to information ion.) 
: question. 
ІТЕА I | (Relevance: Answer to information question.) 


204 Intonation and Grammar 


Or in a different context: 


Eee Is John in the house? 

e: . 

1 р 13 YES 1 | (Relevance: Answer to yes-no question.) 
S 11 (Relevance: Answer to yes-no question.) 


т x 5 
hus the low-rising variants seem to serve as alternate forms of the 


озь case after case. It is tempting, then, to subsume the low-rising 
ders 5 EEN the falling contour, but two considerations militate against 
AGE : First, there is the stubborn fact that the low-rising is simply not 
Bu vs the falling contour; second, there is the fact that the relevance- 
the pr unctions of the low-rising variants are rather sharply limited to 
ауе Poe of questions; there are many functions that falling variants 
E iu low-rising variants do not have.? One of these functions is 
ei а айоп of the context, or of some part of the context. This rele- 
follow; a kind of assent to the proposition that the context makes. In the 
Ж ng dialogue, the falling variants fulfill this relevance in a natural 
Y, but the low-rising variants sound odd or impossible: 
| е: John drank tea. 
1 TRAN З TEA 1 | (Relevance: Recapitulation.) ` 

1 (Relevance: 222) 


Mi tie following dialogue the falling variants fulfil th 
ance, but the low-rising variants do not: 


e contradiction 


с 
йч John drank tea. 
pu 3 WINE 1 | (Relevance: Cont 
МЕ 1 4 (Relevance: 7??)!° 
ow-rising up as а 
s distribution. 


radiction.) 


best to set the 1 
ut eccentric in it 


ion suggests that the low-rising con- 
ses it fails to contrast with the high- 


А А А 
LEE of these facts considered, it seems 
rate contour, equal with the others b 


ommunicat 
in which ca 
d, as in the following: 


9. Dyi 

. Dy A j 
tour Re ant Bolinger in a personal c 
Tising c Y sometimes mark questions, 
ontour in the relevance marke 


Cont 
ext: 
ез Xt: Johnson wants higher taxes. 
Ponse: 7 SQ I 4 
35035 


It 

contours be that there are many such 
Е GE the same relevance. In t 

. Th ed, I have usually had great 
differ in E seem to be very few variants of \ 
low.risi ength. Perhaps the reason is that there si 
ange E variants to wander in, Since the lowest © 

the highest of the low-risings- 


arious kinds of dialogue where two 
he cases Ї have isolated where contrast seems to 

difficulty in coming to a firm conclusion. 

the low-rising contour excepting those that 

imply isnjt much pitch ‘space’ for the. 
f the high-risings are just above the 


places in У 


" Richard Gunter 205 


. We have to do, then, with four contours that can occur over mono- 
syllables in the usage under discussion. To summarize what has been said 
about them: Each of the intonations can be assigned to one of four gross 
shapes, or to state the matter differently, there are four contours, each 
with several variants; in a given dialogue a particular variant may have 
some distinctive expressive function, but expressiveness is elusive and 
unstable from dialogue to dialogue, and even appears to change in a par- 
ticular dialogue when there are changes in the real surrounding situation 
or in the attitude of the speaker; linguistic meaning lies only in the shape 
of the contour, and that meaning is the way the contour connects the 
response to the context; these facts allow us to see that intonations are not 
made up of discrete phonemes of pitch and juncture. Finally, although the 
low-rising contour is somewhat eccentric in its functions, all of the con- 
tours can occur with any monosyllable in at least some function; more- 


over, any intonation that occurs with any monosyllable can be assigned 
to one of the four contours. 


The four contours likewise ај 
This contention rests upon an 
is everywhere made in the lit. 


pply universally to polysyllabic responses. 
assumption, to be sure, but that assumption 


erature on English intonation; study of the 
Present usage, moreover, turns up no evidence against it. The assumption 


is this: longer contours like those on the right below can be equated with 
the shorter ones on the left below, which we have already dealt with over 
monosyllables: 


Shorter variants Longer variants 
(Falling) 3JOHNI| 
(High-Rising ` 3JOEN 31 
(Falling-Rising) 3JOHNI4 
(Low-Rising) IJOHN 1+ 


3 JOHN is in the house 1 y 
3 JOHN is in the house 31 
3JOHN is in the house 11 
1JOHN is in the house 1 1 


i on this assumption, remains the same 
no matter whether it covers a single syllable or is Stretched out to cover 
assumption: 


Context: Who is in the house? 


Responses: 3 JOHN 1 | (Relevance: Answer to information question.) 
3 JOHN isl} (Relevance: Апу i dE 


Ог араїп: 
Context: John is in the barn. 


206 Intonation and Grammar 


eo 3 d OHN31 (Relevance: Reclamation.) 
IPM is 3 t (Relevance: Reclamation.) 
3 N is in the barn 34. (Relevance: Reclamation.) 
| m d variants mark the same relevance in these dia- 
EST DNE a act argues that the length of a contour has no meaning, 
the he у ап ашота с concomitant of the number of syllables that 
B ined eer Longer and shorter variants are thus to be equally 
ye under the contour of which they are manifestations. 
E E contour thus embraces much variety, yet а priori it seems 
COMM x all the manifestations of a contour must remain within some 
ox ERR D limits; at least a given manifestation must be assignable to 
Be of co Fe? contours without error. Yet it is clear from à detailed 
На душ and their variants that the latter are sometimes rather 
. We have already looked at a few of the variants of, for example, 


the high-rising contour: 
| Ht 
| 334 
224 
Es variants look like strata, 
But En visualize the bundle of falling 
other Nu are other variants of contours 
ceed fro t merely in pitch stratum, but in the manne 
or es m syllable to syllable. One of these variants о 
as in ушр is a smooth descent from the peak to the b 
Co e following: 
ntext: How was the play? 


each pitched at its own level. In the same 


variants. 
that seem to differ from each 
r in which they pro- 
f the falling contour, 
ottom of the fall, 


Res 
ponse; 
se; Wo 
N 
d 


e 


fu, 


A : 
Ser variant might Бе call 
х P comes іп ап unaccented sy 
Sponse; 


ed the humped descent;!? sometimes the 
llable directly after the accent: 


der f. 


WON 2, 


1, 

the ud Indiana University dissert 

ime uon that informants expane · 
at they expand and supply const! 


tes This intonation.came to MY notice throu 
earlier discussion of it to James Sledd. I have not see 


ation (1963) offers some experimental support for 
nd shorter contours to longer ones, at the same 
tuents to fill out elliptical forms. 

gh Stockwell (1962). Stockwell attri- 
п the Sledd work. 


Richard Gunter 207 


Sometimes the hump is included in the accent syllable: 
Response: N 
onde 


la, 

All of these variants are clearly manifestations of the falling contour, 
for they all signal the same relevance in this dialogue, and they do so in 
other dialogues without exception. It is possible, in fact, to take a list of 
expressions that have the falling contour, render them with the smooth 
descent, and then translate them into a humped descent, as in the dia- 
logue below, where it is clear that either the smooth or humped descent 
fulfills the relevance Answer to information question. 


Context: Who is in the house? 


— Responses: 
.. Smooth descent Humped descent 
D 
Л JOHN" i, 
H. ГА 
N , 
У, ^, 
5 А LA 
п u, 
fh So, 
a е 4 
ћ, 
y 
1 Ki 
8 ВОВ is i, 
8, D 
е 
È A 
dé d 0, 
ү 1, Ya 
е 
~ 2, 
u, 
Te, 


It is even possible to translate a Smoothly-falli i no- 
X - а то! 
syllable into the corresponding humped ee dite a 
* Е 
o 20%, 
H 
ја de 
The other contours also exhibit v. 
their procedure from syllable to s 
contour has one variant that begin: 


ariants that differ in the manner dE 
yllable. For example, the high-risine 
5 high and continues as а monotone t 


208 Intonation and Grammar 


the end, where there is, at most, а barely perceptible further uprise at the 
very end; a second variant ascends stairstep fashion syllable by syllable 


to the very end, as follows: 


Context: The play was wonderful. 
Responses: 


Stairstep rise Monotone 


1 
WON WON der f" 


These variants also seem to be fully i | 
where either can serve to mark relevance. Yet both always signal the same 


thing, as both here signal the relevance Is this what you said? The same 
can be said for other variants of particular contours, so that ordinary 
variants that differ from each other in their pitch strata are equal in func- 
tion to those that differ in their manner of proceeding from point to point. 
But we have not quite done with variety in contours, for it is obvious 
that any variant of any contour must be actualized within the limits 
imposed, by the segmental phonology of the material embraced by the 
intonation. (Segmental phonology must be taken here to include not only 
the consonant-vowel sequence, together with the distribution of stress 
and plus juncture, but syllable count as well.) Some aspects of the 
Segmental material are necessarily registered in the manifestation of the 
contour, Consider merely that if the material contains à voiceless Stop. 
that fact will be registered by & break at that point in the intonation, 
however brief it may be; and if there are two syllables in the ШШШ a 
Or twenty – that fact will also be registered in the intonation, Given = 
considerations, it seems inescapable that no two contours Se SC e 
the same unless the segmental material that they cover 1S also identical. 


nterchangeable in any dialogue 


е intonation occurs on any syllable of the 
lables before the accent bear a precontour, 
recontour is marked simply by the 


When the accent of а simp! 
response after the first, the syl 
às in the sentences below, Where the p 


pedagogical aid in the classroom; it is also 


isa good E 1 
[orc fintonation, especially when there is a 


13. The humming of intonatio 
formance 0: 


Useful in the а! i Y rfo 
nalysis of one's own Рё ^ X М Дд 
Question SE intonations аге alike or not. Humming has the virtue that it 


Strips away all the consonants and vowels except m; also (as far as I can tell) it strips 


awa; A ing preserves only the number of syllables, their length, 
MINA at ne а hummed intonation with several actual sentences, 


Volume and pitch. One can fill i Ч 5 
that is, E Aa airing of words in some grammatical construction to the hummed 


intonation. 


Richard Gunter 209 


numeral 2. The precontour is to be understood as extending from that 2 
to the accent: 


2 The play was 3 WONderful 1 | 

2 The play 3 WAS wonderful 1 | 

2 The 3 PLAY was wonderful 1 | 
3 THE play was wonderful 1 | 


Precontours may thus be rather long, rather short, or non-existent; also 
they vary greatly in other ways. We have already seen some of the ways 
in which the variants of contours differ, but they are at least under the 
constraint that they must somehow stay within a range, for every variant 
must be assignable to its contour without error. But precontours are not 
under even this constraint, for they have nothing to do with the marking 
of relevance; consequently, they exist in many different patterns. A few of 
these are sketched in the following paragraphs. 

First, the precontour may be level — a monotone on one or another 


pitch plateau. Below are three of these plateaus that seem to this writer 
possible and natural: 


Context: Where is John? 
Responses: 


о 
U 


(low-level) John is in the "e 


Н 


о 
(mid-level) John is in the U 


5 
Е, 


(high-level) John is in the Н 
о 
U 


5 
Е, 


Another kind of precontour is the stairstep up, 


as in the following 
response:!^ 


14. Stairsteps up are sometimes too rich in levels to be marked in the restricted 
inventory of Trager-Smith symbols, a fact pointed out by James Sledd in his review of 
Kingdon (1960). 


210 Intonation and Grammar 


Context: Where is John? 


Response; H 
the О 
in U 


is 5 
John E. 


Such stairsteps up can take many 
E material covered; that is, 
m es, correspondingly greater variety is 

n with the response above at least two other 
Possible: 


John is in the H o 


John is in the H o 


5 
Е, 
кее аге тапу other sorts of pattern in р! 
Wande own, metrical effects, and much else, 
Tesear МЕ more extravagantly than d 
aset, er who attempts to study precont 
e E with their variety that he begins t 
to d to any system at all. But it is clear 
Si o with the marking of relevance, whi 
| 2. the disconcerting variety of precontours. 
АУ is, however, one kind of restraint upon 
X GC we have already seen in the vari 

that шош must reflect some of the 
Iepiste, Covers, It must break momentarily at 
r the number of syllables that it embraces. 
ies ultimately we shall be able to find order i 
fall SE there is a limited numbe! 
enjo nto, such as stairsteps, monotones an 
Bai ПУ sorts of actuali 
Stairst al enclosed. For examp 
SE ер patterns, and the precis 
tly dependent upon the num 


је, there may be 
ise form 
ber of syllables 


ntours exhau: 


ear that р 
ch can be studied quite apart 


f the facts of the pho! 
a voiceless stop, and it must 


forms, depending directly upon the 
if the precontour is long, with many 
possible in the steps up. But 


varieties of stairsteps are 


recontours: there are stair- 


for precontours seem to 


o the variants of contours. The 


stively can be 50 
o doubt that they can be 
recontours have nothing 


variety in precontours, à 


ants of contours: in some way 


nological material 


Given such facts, it may 
п precontours by making 


г of patterns that precontours 
d the like; and these patterns 
zation that depend upon the phonological 


only a small number of 


s that these take would seem to be 


available for the making 


can be only one step up if the pre- 


S % 
teps. Clearly, for instance, there can 2 
. But it is plain that at the moment we 


Dtour covers only two syllables 


Richard Gunter 211 


do not know enough to make final pronouncements about the number and 
kinds of precontours. 


This paper may have seemed to argue that two procedures are always 
easy to carry out: 


1. The investigator can always decide whether an intonation is simple ог 
not. 


2. If an intonation is found to be simple, the investigator can always 
decide whether it consists of a contour alone, which can be identified as 
one of the four established, or whether it is one of those contours plus 
a precontour, the two being easily and neatly separable in such a way that 


the investigator can always assign the relevance-marking function to the 
contour alone. 


In fact these procedures are easy to carry out in most cases, but in some 
Cases they are not easy. It now seems proper to bring forward one of these 
difficult cases. This is the grossly. falling-rising intonation that is said to 
be favored by some Englishmen for use with yes-no questions. This 


D also occurs among Americans with information questions, aS 
ollows: 


The same intonation sometimes occurs with statements, 
; » 


0. 


Dy 


as follows: 


1 e. 


In isolation this intonation is difficult to mark, It gives rise to a series of 
questions: Is this intonation simple? If so, is the accent on house, so that the 
contour is low-rising and So that John is in the is all under a precontour 
m ow SE in the manner of Stairsteps? Or is the accent ОП 

ОПП, 50 that the entire intonation is a mani i i isi 
nifestatio а -rising 
i in : З n of the falling 


15. This example is taken from Gleason (1964, p. 22). This item begi dialogue, $9 
that it has no overt context. Students often render it with the j КОЧА nder dis 
cussion, and the class then usually have difficulty marking it Th A A ws that ensue 
are settled by professorial fiat, a kind of authority that must SC fed sne 
arguments over the past fifteen years. ПРЕВАРА 


212 Intonation and Grammar 


Taken in isolation, examples of this intonation do not easily yield 
answers to these questions. The intonation is sometimes refractory even in 
the relevance relation to a context, especially in that relevance that can be 
called denial, in which the context is affirmative and the response negative, 
as in Dialogue А below, or the context is negative and the response 


affirmative, as in Dialogue B: 


Dialogue A 
Context: John's in the house. 


Dialogue B 
Context: John isn't in the house. 


Response: 
ANON: E» 5 


It is possible to render such responses so that they are unambiguous; but 
the point is that many actual occurrences of such responses are not easy to 
mark, for able students of intonation will disagree about them. Points at 
Which scholars have difficulty in marking intonation deserve careful “ы: 
for the difficulties surely are of some theoretical importance. Tt шу! , 5 
fact, that the assignment of accent in the responses above is doner i Я 
that is, uncalled for by the language system; or perhaps the significance 0 


accent place: is ‘neutralized’ in the denial relevance. 
ege tion indicates that an ambiguous response 


But considerable experimenta: М Е 
of this kind, if. ET to certain other contexts, 15 no Sen n a 
аз to accent placement, the placement being revealed by the floodlig x 
effect of that new context. The investigator may, for example, switch the 
ambiguous response of Dialogue B above to Dialogue C below, where he 


will then hear the accent on John. 


Dialogue С 
Context: Who is in the house? 
Response: Jo 
Ay 
i, Ка 
{ Le E 
the ie 


\ Richard Gunter 213 


ied 


` 


у 


But if switched to a different dialogue, the identical response may, 
through the same kind of trompe l'oreille, appear to present the accent on 


house: 
Dialogue D 
Context: Where is John? 
Response: Jo, 
п 
i, 
2 ty но 


'the 


A similar, and even more astonishing fact is this: the experimenter сап | 
render Dialogue С so that the accent is quite clearly on John in the res- 
ponse; then he can snip off the last three words of that response, in the 
house, switch those words and their intonation to Dialogue D, and will find 


that such a response in D will fulfill th 


relevance Answer to information 


. question, and will sound quite genuine. These and other such oddities 
heighten the suspicion that context governs a great part of our perception 
‚ of intonation, in which there seem to be few perceptual absolutes. Thus 
when a sentence is made in isolation we may have difficulty in deciding 
what the tonetic facts are; sometimes, as we have seen with the denial 


relevance, an intonation may remain а 


mbiguous even when it does ћауе 
a context. 


Nor do difficulties end there in the study of English intonation, for there 
are usages, sentence types and complex intonations that have not even 
been touched in these pages. Scholars who have dwelt with these problems 
know that they are not all going to be solved tomorrow with a single flash 
of understanding. Indeed, it may be that English intonation requires not 4 


single kind of explanation but several 


kinds; for it seems likely that iD 


tonation plays not a single role but many roles 18 


But in the usage marked out for treatment in this paper, the role of in- 
tonation is clear: that role is the marking of relevance. Intonations that 


play this role must be studied over sente; 


we tell what significantly different inton 
tell what they mean. 


nces in context, for only thus ca? ` 
ations there are; only thus сап NS 


There are, in fact, four intonational contours that mark relevance. The 
four apply universally to English monosyllables; any intonation that occurs 


16. Some forms, such as commands, often h: 
likely that their intonations play the relevan 
remember that no sentence can be said Without 
there are intonations that serve no Purpose at 


214 Intonation and Grammar 


1 n* 
ауе no overt contexts; thus it seems Ш 
се role. Furthermore, it is prudent 


= " D а 
an intonation; it тау be, therefore; Шш 
ай, 


sage сап be assigned to one of these contours. 
d to embrace several syllables, and may even 
he details of precontours, and even 


with а monosyllable in this u! 
The contours can be stretche: 
have precontours before them. But t 
the details of the contours themselves, signal nothing but the emotional 
stance of the speaker. Thus a variant is not made up of discrete pitches and 
junctures, but merely has a gross shape which permits it to beassigned to its 
proper contour. In the usage at hand that contour is not a product of the 
internal facts of the sentence with which it figures, but is a context signal 
that binds the response to the context. Such context signals make dialogue 


possible. 


References 


BLOOMFIELD, L. (1933), Language, 
BoLiNGzn, D. (1957), Interrogative 
American Dialect Society, vol. 28. 
Bounen, D. (1961), ' Contrastive accen 
vol. 37, pp. 83-96. ito 
GLeason, Н. А. (1964), Workbook in Descriptive Linguist 
Winston. 
Gunren, R. (1963), Elliptical Forms of ће English 
University microfilm. 
Gunter, В. (1966), ‘On the placem 
grammar’, J. Ling., Vol. 2, PP- 159-79. А A EN 
Lees, В. B., and KLIMA, Е. S. (1963), *вшез for English pronominalization’, 
Language, vol. 39, рр. 17-28. Н | 
ТЛЕВЕКМАМ, P. (1965), ‘On the acoustic basis of the perception of intonation by 
linguists’, Word, vol. 21, pp. 40-54. N 
LizngnMAN, P., (1967), Intonation, Perception and Language, Cambridge 
University Press. PLAT 
Рака, К. L. (1945), The Intonation of American English, University © 
Ge 1. 36, рр. 173-8 
LEDD, J. (1960), Review of Kingdon, Language, VOl. 36 PRs e 
Sean R. (1960), "The place of intonation in а generative grammar’, 
кыд. vol, 36, рр. 360-67. xz 
OCK WELL, К. (1962), ‘On the ana E р 
Conference on Problems of. Linguistic Analysis "7 English, рр. 39-55. 
Tracer, G. L., and SMITH, Н. L- (1951), 
Battenburg. 


Holt, Rinehart & Winston. ЗА GE? 
Structure of American English, Publications ofthe 


t and contrastive stress’, Language, 
ics, Holt, Rinehart & 


Transitive Sentence, Indiana 


ent of accent in dialogue: a feature of context 


f Michigan 


D. 


Richard Gunter 215 


11 Frantisek Danes 


Order of Elements and Sentence Intonation 


František Danes, ‘Order of elements and sentence intonation’, To Honour 


Roman Jakobson; Essays on the Occasion of his Seventieth Birthday, Mouton, 
1967, pp. 499-512, 


In his paper ‘Some universals of grammar’ (1963), J. Н. Greenberg! in- 
troduces the notion of DOMINANT ORDER of syntactic elements and explains 
(р. 76) that the ‘dominance is not based on its more frequent occurrence’ 
(a dominant order is not that alternative which is more frequent than its 
Opposite, the ‘recessive’ order) but on the fact that the dominant order 
can always occur while its opposite is present only under specified con- 
ditions, i.e. in co-occurrence with another, ‘harmonic’, construction. 


These conditions are stated in terms of grammatical notions, such as verb, 
object, pronominal object, etc. 


very aptly shows that 
‘dominant order’ аге 
"Lenin cites Marx’ may 
other word order: SVO 
et, VSO Citiruet Lenin 
теза Lenin citiruet, OVS 
the six logically possible 
ng to Greenberg, “до not 
e’, namely VOS, OSV, OVS: 
(selection) of the different variants is regulate VE occurrent? 


two concepts on the basis © 
School linguistics, 


216 Intonation and Grammar 


hi 


У 


pros for various intra-linguistic functions. Second, the intra- 
la S e. employ a set of systemic devices, and there is, ina given 
Biens: i ди iunique mapping of the set of devices into the set of func- 
is E ue jams that to each function a subset of complementary devices 
, and vice versa. 

M gem utterance (i.e. every senti 
E. lausenblas, 1964)) may be апа! 
е пашаш on three different levels 
di are: (1) the level of grammatical stru 

cture; (3) the level of thematic and conte 


ence taken as a unit of discourse or 
lysed (ог represented), within the 
(cf. Daneš, 1964). The respective 
cture; (2) the level of semantic 
xtual organization of utterance. 


eg. 
N 
John bought a book 
(1) 5 у о 
Q)Ag Ас G 
G)T а 


dicate, О = Object; Ag= 
С = Comment.” 


of elements in the underlying patterns 
of the respective words in the corre- 
ds, the fixed order of elements belongs 


features of the pattern.) But there are 
5 not necessarily 


ts in the patterns i 
mmatical 


E; ч 
GE nations: S = Subject, У = Verbal Pre 
gent, Ac = Action, 6 = Goal; T = Topic, 


oo languages like English the order 
SNE, as a rule, to the order 
to the ing utterance. (Or, in other wor 
langu Constituent or even distinctive 
SC E which the order of elements 
devi ; it is not employed, or is only partially employed, as а gral 
ce. This is often neglected especially by those scholars who base their 
каше scheme of linguistic description on English, as it has been aptly 
Sd n by Dean S. Worth (1964). Let us now consider the order of elements 
ifferent syntactic levels in some detail. 


ПА 
2. On the grammatical level the rules О 


Uncti y 
renal rules; (2) concomitant rules; (3) wea 
ition ofa sentence element 15 determined by its syntact 


a Ven way. 
Ti cases where the opposition betwee 
fierent positions 0 


Ре i 
mented (realized) by two di 
nd comment" have been introduced by Yuen Ren Chao (1959) 
M correspond aie AES *theme' and ‘rheme’, used by some Czech scholars as 
ish equivalents for V- Mathesius's terms ‘základ (téma)' and ‘jádro’. C. F. Hockett 
fa common clause as à language universal and calls 


Consi, 
ere the ‘bipartite structure’ О D 
» too, ‘topic’ and ‘comment’ (cf. Hockett, 1963). 


order are of three types: (1) 
k rules. In each group the 
ic function, but in 


п two syntactic categories is im- 
f the element in the sentence 


2. 
The terms ‘topic’ and * 


Ge T 
František Daneš 217 


A 


f 


pattern (the order being thus a distinctive feature), the corresponding rules 
may be called ‘functional rules’ and the order of elements may be termed 
*grammaticalized" (e.g. in English the pattern S-V-O). 

On the other hand, in some instances the position of an element is ‘fixed’, 
and yet the violation of the rule fixing its position in the sentence does not 
lead to a different sentence (with other grammatical relations between the 
elements); the result will only be an ‘ungrammatical’ or ‘less grammatical’ 
form of the original sentence. The position of the elements in the sentence is 
then only a concomitant (‘redundant’, not distinctive) feature of their 
syntactic function. Such features do not belong to the system of the given 
language, but to its norm (the latter being, according to E. Coseriu, à 
commonly accepted, habitual, and traditional realization of the former).? 
Example: dependent genitive case follows its dominating noun in many 


— European languages. 


In the third case, a certain order of elements is ‘usual’; any deviation 
from this order, permitted by the ‘weak’ rule and motivated by special 
non-grammatical conditions, is associated with the feature of ‘non- 
neutrality’ or *markedness'. The possibility of ‘inversion’ is common, €-8- 
in some Slavic languages with the attributive adjective (the usual word 
order being there AN). 

The fundamental distinction between these three types of grammatical 
word order, pointed out by Mathesius, has, unfortunately, been neglected 
by Greenberg;* thus, the latter scholar classes both English and Slavonic 
into the same common group II (БУО; cf. о.с., 87f.). 

In languages with the so-called ‘free’ word order, we must consider 2 
fourth possibility, i.e. a ‘labile’ order. In this case, the order of some 
elements of the pattern on the grammatical level is irrelevant; in utterances 
based on such a pattern, the position of the respective words vacillates 
according to non-grammatical conditions, 


g Languages may differ in the particular set of ordering rules (and in their 
distribution in different syntactic patterns), as well as in the functional loads 
of the rules, 


1.3. It is obvious that in the above quoted Russian example (Lenin citiruet 
Marksa and its variants) the underlying grammatical pattern S-V-O 
contains neither a functional nor a concomitant fixed order. The variations 
of the word order may be due to a usual, or even to a labile, order. It seems 


as if the ‘neutrality (unmarkedness)’ of the variant Lenin citiruet Mark 
Were based on the fact that the underlying pattern S-V-O shows the usu 


3. The significance of redundant features in 
pointed out by R. Jakobson (cf. e.g., Jakobson, 


as 'distinctors* but as ‘identifiers’ (cf. also Co: 
4. Cf. also some remarks of the present | 


linguistic structure was very aptly 
1956). Such features are not operati? 
seriu, 1952). 

author (1965) and his earlier paper (1959 


218 Intonation and Grammar 


order of elements. Їп other words, the unmarkedness of the given variant 
would follow from the agreement (correspondence) of the actual sequence 
of the particular words in the utterance with the order of the respective 
elements in the underlying pattern, while the other variants would be 
experienced as marked in consequence of the disagreement of both orders. 


Schematically: 


Lenin citiruet Marksa Lenin Marksa citiruet 


У о 5 о ү 


Actual sequence S 
S> V> О 


Grammatical pattern S> V> О 


(The sign — shows the usual order of elements in the pattern.) 


But a deeper inspection shows that the matter is not so simple. Let us ` 
Consider the following Russian examples; (a) Rebjata kupalis’ (S V) and 


(b) Nastala vesna (V S), both of which must be intuitively and empirically 
Considered as unmarked (the marked variants being Kupalis’ rebjata and 
Vesna nastala, respectively). At the same time it is clear that both sentences | 
are based on one and the same underlying grammatical sentence pattern; 
#8 both possible orders may (under certain, for the moment unspecified, 
Circumstances) lead to an unmarked utterance, we must conclude that the 
Order of elements in the underlying pattern is free (labile). ` 

5 under which some sentences of , 


1.3.1, Tt remains to explain the condition ces c 
Jf the explanation In 


this type are neutral (unmarked), and vice versa. a 
grammatical terms has failed it must be sought for on the semantic level. 


t us suppose that there are two semantic types of verbs, labelled, for 
this moment, X and Y, respectively, and that the verb kupalis’ *they- 
bathed’ belies to the type X, and the other, nastala ‘she-came’, to the 
type У. According to our basic model of the three syntactic levels, we 
assume, in this case, two different semantic patterns underlying our $еп- 
tences (a), (b), namely: (a) Ag> X: (b) У— B^ with the usual order of 
Clements. The respective tabular arrangement may be as follows: 

(a) Rebjata kupalis’ (unmarked) Kupalis’ rebjata (marked) 
grammatical асі. s.: у Y S л 
у (labile order) У S (labile order) 


evi 5 
х Ag 


Semantic 1 x 
ts: Ав 
ERR g>X (usual order) Ag— X (usual order) 


1 


5. B stands for a correlative term to Ag in the domain of Y. 


František Daneš 219 


(b) Nastala vesna (unmarked) Vesna nastala (marked) 


grammatical. act. s.: У 5: и 5 У 
1еуе1 pattern: V S (labile order) S V (labile order) 
semantic actis. у B B Y, 
level pattern: Y — В (usual order) Ү — B (usual order) 


The matrices show that the marked (non-neutral) variants of the sentences 
are those which reveal a disagreement between the lines of actual sequence 
and of semantic patterns. 

The significance of the semantic level for word order has not yet been 
sufficiently recognized and investigated. Nevertheless, the pioneering 
work of some Czech scholars (Firbas, 1953; 1961; 1962; 1964a; Adamec, 
1963; 1966; Beneš, 1962; Novak, 1959; Pala, 1966; Uhlířová, 1966), 
devoted to the contrastive analysis of English, Russian, and German with 
Czech, as well as some suggestions of other linguists (esp. Hatcher, 
1956a, 1956b and Worth, 1964) have proved very promising. Thus, 
e.g. our semantic verbal category Y might be described as denoting 
*existence* or ‘bringing into existence (ог upon the stage)’ and its relevance 
to word order may be traced in different languages. Ascertaining relevant 
semantic categories and sentence patterns in various languages is one of 
the most important and interesting tasks of structural analysis. 

The semantic category * Y is associated with additional linguistic means; 
the order of elements being only one of them. e.g. in English, where the 
order of elements is more grammatical than in Russian, this category will, 


" TUNE cases, be accompanied by a special construction, namely of the 

SE 5 i +++; €. There were some new pictures on the walls. 
etess, this construction involves an ‘inversion’ 

e ег as 

well (cf. Greenberg's а рес 


"harmonic construction’): i 
( ; inst have 
here “there + VS”. It follows that Sg 


The significance of the semanti 
с structure of th rder 
of elements may be clearly shown on е sentence for the о 


у Sentences rendering man’s inner 
т e The person who is the ‘recipient (R)’7 of sensations, 
of states, is expressed differently in different languages, bY 


220 Intonation and Grammar 


means of different syntactic patterns (cf. the following examples); never- 
theless, in all our examples the phrase expressing К. stands at the beginning 


of the sentence: 
Russian: U Ivana bolit golova (‘With Ivan aches head’) 


Adv. V S 
Czech: Ivana bolí hlava 
Оз AS 
English: Ivan has headache 
уйдо Ў 


ption to think that for every sentence 


1.3.2. But it would be a false assum 
1 (unmarked) form. e.g. whereas the 


there must necessarily exist a neutra 


Czech sentence Zvoni telefon (VS, “The telephone rings’) is experienced — 
in contradistinction to its variant Telefon zvoní (S V) – as fully neutral, its 
e variant Telefon nezyoní are 


Negative counterpart Nezvont telefon and thi 

felt as marked (although in different ways). The explanation should be 
looked for in the marked character of negation and in the different semantic 
Value (position) of negative statements as opposed to positive ones. 

On the other hand, there are some semantic types of sentences that may 
Occur with a single order of elements only, even in languages with the 
So-called “ free’ word order. Thus, in Czech sentences of the type Lev је, 
Selma (‘Lion is a beast’), i.e. in utterances that denote the placing of an 
individual into a class, the subject should precede the predicate. 
1.4. So far we have analysed sentences in isolation, i.e. taken out of con- 
text and situation, as abstract structures. But these structures would be 
employed as concrete utterances in different contexts and situations. We 
Should even admit that our above judgements concerning the neutrality of 
the particular sentences may have been sometimes more or less uncertain, 
influenced by possible contexts and/or situations. (The most 'uncontex- 
tual’ sentences seem to be the generalized statements, such as “Lions are 

asts’, “Man is mortal’, ‘Dogs hate cats’, etc). 
14.1. If we try to explain the different. variants of the above-quoted 
Russian sentence ‘Lenin citiruet Marksa’, we come to the conclusion that 

е Variations are motivated by their contextual (and situational) depen- 
dence and applicability (eve iant clearly presupposes а 


n the neutral vari 
Certain context, ог, more precisely, à certain class of contexts). In other 
У E у 5 
Ords: every utterance points t 


о а *consituation" (to use Mirowicz’s 
term), 


ja nalysing the structure 
ee pattern. The resp 
S Nt aspects, (a) Taking for granted thi 
*ry utterance is, in principle, an епипсіа 


nce from this point of view, we state its 
parts may be defined from two dif- 
d that in the act of communication 
tion or statement about some- 


of uttera! 
ective two 


František Daneš 221 


thing, we shall call the respective parts ‘topic’ ог ‘theme’ (something that 
one is talking about) and ‘comment’ or ‘rheme’ (what one says about it). 
(b) Following the other line, linking up the utterance with the consituation, 
we recognize that, as a rule, the topic contains ‘old’ or ‘already known’ 
elements, while the comment conveys the ‘new piece of information’. 
Professor Vilém Mathesius, who elaborated these ideas (under the heading 
of the ‘actual sentence bipartition’ ‘or functional sentence perspective"), 
pointed out that as a rule, both aspects (the ‘thematic’ and ‘contextual’) 
coincide so that in most cases it is not necessary to differentiate between 
them. (In our rather sketchy account we shall use the terms ‘Topic (T)’ 
and ‘Comment (C)’.) 


1.4.2. The significance of this bipartition for the order of sentence con- 
н stituents varies. ‘Typically in Chinese, Japanese, Korean, English and 
. many others, one first mentions something that one is going to talk about 
and then says something about it. In other languages, the most typical 
arrangement is for the Comment, or part of it, to precede the Topic...” 
(see Hockett, 1963). Of course, such a general statement necessarily im- 
plies much simplification. It was Mathesius who — as early as at the be- 
ginning of this century — pointed out the fact that the order of sentence 
elements in every language is governed by a set of different factors (or, 
from another point of view, is operative in a set of different functions), 
the T-C bipartition being one of them only. The linguist has to ascertain 
_ the hierarchical ordering of these factors for different languages. Some 
decades later, Firbas drew attention to the fact that, on the other hand, the 
T-C bipartition employs a number of other linguistic means (the order 
being one of them only), namely lexical and grammatical devices (such a$ 
particles, articles, constructions) (Firbas, 1964b; 1966). Mathesius’s line of 


thinking was followed by a number of Czech, Russian and other linguists 
in the analyses of different languages, 


1.4.3. It is evident that in languages with the so-called 
languages where the order of elements 
grammaticalized and/or fixed), 
in concrete utterances, 


‘free’ order (i.e. in 
on the grammatical level is less 
e Es SC E the sentence components шз 
1 сопсг ега mployed for the ригро: i i Т. 
bipartition. This is (ће case, e.g. of Russian. How M e i wit 
our model of the neutral (unmarked) order? It is easy to recognize that it is 
exactly the contextual bipartition that motivates the non-neutral (marked) 


order. Thus, in the Russian utterance Leni Mark: tiruet We get t 
п 
sa Cl 


8. СЕ, at least, his article (1929). Some i 
S . Russi үт? 
(smyslovoe) členenie’ (В. A. Il'ji$, К. G. Kagepe PP ешеш RE 


222 1пїопайоп and Grammar 


Lenin Marksa citiruet 


levels 


grammat. S 


Fi o 

Mon the matrix we learn that the u 

i en changed under the impact of the 
partition, The resulting utterance lini 


Point of view of the linguistic system 


1, d 
© 4.4. Thus, by means of our matrix we 
m which has no ‘minus’ sign in column 

is statement is, clearly, th 


9f a usual order. 
чш e the explanation of 
Bac, t in the interaction (interre! 
GER levels. In patterns, some of thi 
Рани bound, by а strong 
SC ance, all three levels are 
aor has a neutral word or 
at least one pattern contain 


and in thi, D 
(and in this sense it is not marked, but norma 


its , z 
word order will be experienced as таг. 
can define a ‘neutral sentence’ as 


5. The linguistic interpretation of 
at the functional needs of the 


d e А E 
o not — in the case of the ‘neutral sentence’ — lead to the change (inversion) 


the notion of ‘neutra 
lations) of patterns О! 
eir elements (or 


or by a weak rule. If, 
‘in agreement’ (coordinated), such an 
rder. (The пе 
s a bound el 
ction between then 


cessary 
Jement. In utterances that do 


Syntactic Actual sequence Underlying pattern Order A ARR Result 
1 2and 3 
2 3 4 5 6 
CDM о V 5 у О labile О 
antic Ag G Ас Ав Ас а usual — marked 
T> C usual + 


ШС” ТУС 
sual order on the semantic level has 
functional needs of the contextual 


ks up fully with the given context 
1), and yet, if valued from the 


(i.e. as found in an isolated sentence), 


T-C bipartition 


] order” ought to be 
n the three different 
all of them) may be 
in the matrix of an 


condition is, of course, 


eutral and the marked 


D T 
Ot fulfill this condition, the distin 


Order is irrelevant.) 
Be notion of *marked word order’, on the. І 
olution of a conflict between levels. The solution involves the existence of a 
pecific linguistic devices. There are two 


lerarchy of levels and of some 5 

lerarchies: (a) hierarchy of different orders: (1) strong rules (i.e. gram- 

Thaticalized order and fixed order), (2) weak rules (usual order), (3) free 
(1) T-C level, (2) semantic 


Ed (labile order); (b) hierarchy of levels: (1). i 
evel, (3) grammatical level. The means for solving conflicts are: (a) ‘in- 


on the other hand, implies the 


їп different utterances, with different sentence 
former are only two in number (and often 


9. The components T-C coincide, 
nk from one to a theoretically unlimited 


Sp ats Or groups ofthe While the 
Ge ise dividing li Jatter га 
Dumber, precise dividing line), the la 


e 


František Danes 223 


- S = У = О, showing the grammaticalized order. There are, generally, two 


version", in the case of weak rules; (b) sentence intonation; (c) particles, 
articles, lexical means, specific grammatical constructions; (d) selection 
of a different pattern. 


1.5. At this moment we must answer two essential questions: 


1. What happens when, in a matrix of the above type, a grammaticalized 
or fixed order appears? 

2. Is the inversion C — T also possible (and if so, what would be the con- 
sequences)? 


1.5.1. Question (1) may be demonstrated by the English sentence ‘John 
hates Mary’. Its matrix contains the grammatical pattern S = V = О 
with grammaticalized order. This utterance fits in with a consituation 
where the topic of the discourse is ‘John’. But in a different consituation, 
where the topic would be ‘Mary’ and the comment ‘the hatred of John 
for her’, the order of sentence elements should be changed. Column 2 
(actual sequence) in the respective matrix would be: 
i 


ОУ S 
G Ас Ag 
muc 


But the first (grammatical) line OVS is incompatible with the pattern 


ways of solving this problem: either to reli 
ing the T-C bipartition (in fact, there are di 
“sensitiveness’ for contextual needs), 
grammatical construction. In the 
make use of the passive constructi 
Ag and G, respectively, 
Mary is hated by John h: 
construction, the differe; 


nquish the possibility of render- 
fferences in languages as to their 
or to use a different, more suitable 
case of our English example, we сап 


Оп (in which words or phrases denoting 
are mutually replaced). The passive construction 
as the following matrix (in contrast to the active 
nee appears on the semantic level): 


Passive Active 


е t claim that in English the passive constructi i 

cM possible solution of all cases. There are other means, such ad pe Ee 
ite and indefinite article, sentence intonation, and others.) 
E an answering question (2), we shall consider the English sentence 
а 5 writing to his father . If we take it as an answer to the question!? 
Bor d John doing?’, we should assign the topic value to *John' and the 
E roa е sentence would form the comment of the utterance. This is the 

E. al, neutral case, with T E C огдег. Now, imagine the same sentence 
e si ML О another question, VIZ. *Whois writing to his father?’ In this 
ос SN John' is felt as comment and the rest of the sentence as the 
E Zë the utterance, Thus, the order of components would be inverse, 
The С А апа ће utterance experienced as marked, with emphatic coloring. 
E planation of this shift must be sought for in the fact that in the res- 
| ЊЕ matrix, in the last line of column 5, the sign ‘minus’ appears, due 

inversion of the usual order T > C. : 
as it follows from our 


At this point, the conclusion is not surprising, 


definiti Я ` 
efinition of ‘neutrality’. But one must ask whether there 15 any formal 
al structure of the given 


(Of course, we do no 


х 


Ree should be inferred from the consiti only. 

m however, avails itself 0 alling the com- 
tob of the utterance, viz. the se t 

n of great importance and in the next section Ot г 
фи әле the principal regularities between sentence intonation and T-C 
Траг  оп of utterance. Р у 


2. А 
SC functional diapason of se” 
onation of utterance), as One of the pros 


dr recently been very aptly pointed out by 
€ expressions ‘prosodic’ and *paralinguistic" denote *a scale which has 


3 its “most prosodic” end systems of features (for example, intonation 
пош) which can fairly easily be integrated. with other aspects of 
повно structure; while at the “most paralinguistic" end there are the 
atures most obviously remote from the possibility of integration with the 


linguistic structure proper - -- > The systemic functions of intonation 
1960), one of them being simply to signal the 


со 
оша are several (Danes, 
Structure of utterance. 1 


2. 
1.1. It appears that in many langua 


ntence intonat 


tence intonation (07, more correctly, of the 
odic features, is very wideand, а5 
Crystal and Quirk (1964, p. 12), 


ges (perhaps in most, but we dare not, 


10. The questions are used here in order to elicit a proper consituation. 


František Daneš 225 


A 
mi 


for the present, make this a universally valid statement), the comment of 
_ the utterance would be associated with the center (nucleus) of the (ter- 
minal) intonation contour.!! This means that in languages where, as a rule, 
the comment is placed towards the end of the utterance (cf. Hockett's 
semiuniversal), the centre of the terminal intonation contour (CI) should 
be located on the last stress-unit of the utterance. e.g.: The train has cóme. 
John hates Mary. German: Der Zug ist gekómmen. Russian: Prišel pdezd. 


2.1.2. But as we have mentioned above, the grammatical rules of the Eng- 
lish word order do not fully allow rearrangement of the sentence con- 
stituents according to the needs of the consituation (i.e. in accordance with 
the T-C structure). Thus, in the sentence There were some pictures on the 
walls, the phrase ‘on the walls’ does not evidently belong to the comment 
(the definite article signals the respective notion as already known); 
consequently, the CI would not be placed on the last stress-unit of the 
utterance, but on the last stress-unit of the comment of the utterance, 
particularly on the word pictures. A contrastive comparison with languages 
having the so-called ‘free’ word order appears to bevery illuminating. Let us 
~~ consider the following example (cf. Worth, 1964, p. 50): 


Russian English 


in consituation (a): Krovati stojali v jego 


The beds were in his rsom 
kómnate 


in consituation (b): У Jego komnate stojali There were béds in his room 


krovàti 


Thus in languages like English, the CI would often be placed on non- 
terminal elements of the utterance, while in others, as for instance in Slavic 
languages (conspicuously in Czech), the normal suprasegmental (prosodic) 
shape of the utterance (i.e. of the non-emphatic utterance in which no wor 
is emphasized for contrast) is that with the СІ іп the last stress-unit. The 
supposition that the English sentence is only to a small degree sensitive (0 
the needs of the context turns out to be not fully valid; the differences 
between languages lie mainly in the different means that x employed (25 
has been ingeniously pointed out by Firbas in connection with the use 9 
articles). = 

I will adduce some other examples, in which the comment occupies # 


position very remote from the end of the utterance (so that a relatively ve? 
long terminal intonation contour ari 


ses): (1) ‘[They wi iver, bU 
the water was so swift thatthe monk ) "IThey went to the river, ` 


ey was afraid. "Get on my neck”, S?! 


11. Some authors call this center *the Sentence stress’ or ‘the logical stress’. 


226 Intonation and Grammar 


- the terminal CI, in English, on the other 


the elephant, “I shall carry you . . .] I am not afráid to swim across a swift 
river? ? (Jassem, 1954). (The phrase ‘to swim across a swift river", although 
standing at the end, conveys the notion which has been explicitly men- 
tioned in the previous context and consequently belongs to the topic, not to 
the comment, of the utterance.) – (2) *[They've been cleaning them up the 
whole morning.] I’ve never séen such energy’ (Lee, 1960). (Thenotion ‘such 
energy’ is implied in the preceding utterance.) 

2.1.3. To sum up: While in Slavonic (especially in Czech) the variability of 
word order is compensated for by a rather uniform (automatic) location of 
$ hand, the highly fixed wordorder 
is compensated for by a great variety of the possible positions of the CI in 
the utterance. In other words: in English it 15 rather the suprasegmental 
phonological structure that signals the ‘functional perspective of utter- 
ance’, i.e. the points of the highest communicative dynamism. Thus, we 
may conclude that the functional load of the two linguistic devices is 
different in various languages. (It is worth mentioning that the relatively 
high functional load of intonation in English shows itself in relation to 


Other linguistic functions as well.) 


2.2. Utterances conveying emphasis are governed by special rules. At least 


two classes of emphatic utterances should be distinguished: (1) emphatic 
Utterances proper; (2) utterances with emphasis for contrast. In class OI. 
the emphatic feature characterizes the utterance as а whole, while in class 
(2) this feature is associated with one particular element of the utterance 
only, 
22.1. The emphatic utterances proper are characterized by the inverse order 
On the contextual level, i.e. by the order C-T, and, consequently, by the 
Onset position of the CI (this being located on the initial stress-unit of the 


Utterance) on the suprasegmental phonological level; e.g.: The train has 
Соте! (in contradistinction to ‘normal’ The train has cdme). In Russian: 


Pdezd prišel! x Poezd prisel; and also (due to the ‘free’ word order) 
Prišel poezd! x Prišel pdezd. In German: Der Zig ist gekümmen! х Der 


Zug ist gekómmen. 


trasted with another (either implied 
ed idea’ (Jones, 1956, p. 277). Bolinger has 


Called such utterances ‘sentences of the second instance’ (1952). Con- 
trastive emphasis may be rendered — according to circumstances — by a set 
Of means: (a) word order, (6) а shift of the CI (it would be displaced from 


tement of Halliday, McIntosh and Strevens: *. . . it is 


12. Cf., e.g. the following sta d = 5 
glish makes extensive use of intonation to carry 


important to realize that spoken En 
Brammatical meaning’ (1965, P- 53)- 


František Danes 227 


its *automatic? (neutral) position), (c) a specific phonological form of the 
intonation contour. It is clear that in English the possibilities (b) and es- 
pecially (c) are the most common, e.g.: I sent a book to hér (and not to 
someone élse), the contrast having been achieved by means of the shift 
of the CI. (The normal form would be / sent a bdok to her, according to 
the rule that in sentences ending with a preposition and a pronoun, the 
final pronouns are not stressed.) In addition to it, the contrast may be 
pointed out or modified by means of special intonation contours. Other 
examples: [Look out, here comes а càr.) It looks like dur car. What are you 
going to do? It was extrémely cold this year. I nearly did forget it (the em- 
phasis is foregrounded by means of the construction with ‘do’ as well). 
John loves Mary (and not George) = It is Jóhn who loves Mary (with a 
special construction). J sàw the man coming along the road (reassuring the 
person spoken to; а very long intonation contour). Г met her father (and not 
her mother or brother . . .); in this case the contrastive emphasis is ren- 
dered by means of a specific (emphatic) contour only, as the contrasting 


` word is in the final position, so that the placement of the CI cannot func- 


tion as a distinctive feature. 

Many analogous examples might beeasily adduced from otherlanguages- 
German: Jhr Bruder hat lange auf Sie gewartet. || Ihr Bruder hat lange auf 
Sie gewartet. | Ihr Brüder hat lange auf Sie gewartet. (The normal form: Ihr 
Bruder hat lange auf Sie gewartet (Essen, 1956).) Some Russian examples 
may be found in the book by Buning and Schooneveld (1961, see also 
Ebeling, 1958), Czech examples in the book on Czech sentence intonation 
by the present author (Daneš, 1957). It is exactly this contrastive emphasis 
that is often referred to as ‘logical (sentence) stress’, 


2.2.3. In addition, it is possible to use a specific non-terminal intonation 
contour for singling out the topic of the utterance; e.g. in English: МУ 
bróther | went to Brighton (implying: ‘as for my brother, he . . ."). Jf # 
succteds | I shall make . . .; in Russian: Esli tebe grüstno E ; .;in German: 
Geheime Sorgen | sind eine schwere Last. A ay 


2.3. It has often been suggested that sentence intonation performs proper 


grammatical functions and lexical (semantic) functions as well. Undoubt- 
edly, such examples, as: 


1(а) I didn't visit the dòctor | because I was ill. 
(b) I didn't visit the doctor because I was ill (Lee 19 
, 1960). 
2(a) Please wire | if Гат to сдте, | | 
(b) Please wire if I am to come. 
3(a) It is те cduntry | that suits my wife bèst. 
(b) It is the country that suits my wife best (Schubiger, 1935) 


228 Intonation and Grammar 


of the suprasegmental phonological structure 
semantic. interpretation of such sentences. 
Nevertheless, it seems to us that the different grammatical and semantic 
interpretations are based on two different underlying T-C structures ofthe 


respective utterances. In other words: rather than saying that the intonation 
here works as а grammatical device (distinguishing, e.g. an object clause 
from an adverbial one, ог determining the function of the conjunction), we 
Should rather say that this is an accidental effect of two possible T-C 
Structures of the given utterance. (e.g. in (2а) the ‘if?-clause is marked as 
comment, whereas in (2b) it is the verb ‘wire’ that is signalled as an em- 
phatic comment, while the ‘if’ hing that is already 
known from the consituation.) , 
An analogous situation may be found in cases where sentence intonation 
seems to determine the semantic meaning of a word, e.g.: 
e visited some other places and Prague 


evidently show the relevance 
for the grammatical and 


-clause conveys a t 


4(a) He also visited Prague. CH 
additionally’) 

(b) He Also visited Prague. (‘Others visited it, and he did so a: 
Fairly analogous examples may be adduced from other languages. But in 
all of them we shall find that the two different semantic interpretations of 
also (German auch, Russian tože, Czech též, etc.) are determined by the 
fact that in (4a) the adverb brings out the subsequent utterance portion as a 

wn, whereas 10 (4b), 


Dew fact (comment), in addition to facts already kno 


With also bearing the CI, the situation is inverted. 


. Negative clauses seem to be à field especially favo 
intonation’ in many languages. Cf. the following 


their English counterparts: 


s well") 


Czech utterances and 


За) 5 každým пётіомі. He does not speak to anybody (with a falling-rising 
Contour) 

(b) S kazdym nemluvi. Не 
tour), 


N 
does not speak to ànybody (with a falling con- 


The utterances (a) have the E cA d 
While (b) means, in an appropria т he speaks to no one". It is 
Worth mentioning, however, that the meaning (а) is. 


With the norm: ic) form of thi 
4 al (automatic) fo e 
К while in English t 


Signalled by d i 
je-automation: 
he ЖАЙ (a) that is associated with a marked contour). Nevertheless, 
mmon: the semantic differences 


Oth langua; ther feature in со 
ges have ano Е Я 
(а) and (b), rendered by intonation, depend, to a certain extent, on a 


avorable consituation; the respective intonational forms alone do not 
щу ensure that the utterance will be interpreted only in one of the two 


František Daneš 229 


rable for the ‘semantic | 


i 


T 


ere 


possible meanings, or, in other words, such utterances are not entirely un- 
ambiguous. But, at the same time, both languages have a pronoun, 
namely nikdo, no ‘one (nobody), which, in contrast to každý, anybody, i$ 
unambiguous, having the negative meaning (a) only (in accordance with 
the presence of the explicitly negative morpheme ni- and no, respectively)? 


3. It is self-evident that any systemic linguistic description, whether genera- 
tive or not, has to take into account all relevant facts about the most 
elementary and common linguistic devices, viz. the order of elements and 
the utterance intonation (cf. Halliday, McIntosh and Strevens, 1965, p. 73). 
And it seems to me that the generative scheme suggested by the MIT 
group is, in its recent form, scarcely able to account for them in a satis- 
factory way, mainly because its phonological component does not include 
the sentence intonation; besides, the position of the T-C organization, the 
systemic character of which is hardly to be denied, is yet to be stated. (Its 
considerable stylistic import does not, in essence, contradict the un- 
deniable appurtenance of the T-C principle to /a langue.)'* 


i Cf. Vachek (1947). Klima, however, does not mention these facts in his paper 
(1964). A. more detailed analysis of such negative clauses in C: be found in 
Daneš (1954). RS ES 

14. Some remarks in Chomsky (1965) as well as a personal communication by the 
same author seem to inaugurate a possible attempt to incorporate the said facts into 
the new generative framework. As regards sentence intonation, an earlier study by 
Stockwell (1960) contains some valuable suggestions. 


References 


ADAMEC, Р. (1963), “К úloze sémantik: i | 
, , у ve slovosled: i 
01.4, pp. 2972300, edu’, Slavica Pragensia, 
ADAMEC, Р. (1966), Porjadok slov v sovremennom russkom jazyke, Prague. 
BENEŠ, E. (1962). "Die Verbstellung im Deutschen, von der Mitteilungsperspektive 

her betrachtet’, Philologica Pragensia, vol. 5, pp. 6-19. 
BoLINGER, D. (1952), ‘Linear modification’ PML 

> Я А, vol. 67, рр. 1117-44. 

BUNING, J. E. J., and Schooneveld, C. H. van (1961), The Sentence Intonation 
c of qu DIEN Russian as a Linguistic Structure, Mouton. 

HAO, Y. R. ,'H i d i A Li Х 

ИЕ Sc? iow Chinese logic operates’, Anthropol. Ling., 
Cuomsky, N. (1965), Aspects of the Theor: ji 

Ў у of Syntax, Н. ї 'ess. 

CosERIU, E. (1952), Sistema, norma y habla, MER TA x e 
Скузтат, D., and QUIRK, R. (1964), System t ingui gures 

Aeg ‘ems of Prosodic and Paralinguistic Fea 
Daneš, Е. (1954), ‘Příspěvek К rosboru vý у i 

Ё významové výst ý *, Studie @ 

prace Lingvistické, vol. 1, p. 215. Ry poy Sy 
SE F. (1957), Intonace а věta ve spisovné češtině, Prague. 

ANES, Е. (1959), “К otácze pořádku slov v Hi CR 5 

Slovesnosk, vol. 20, pp. 1-9. EC 


230 Intonation and Grammar 


Dane’, F. (1960), ‘Sentence intonation from а functional point of view', Word, 
n 16, рр. 34-54. 
Nr F. (1964), ‘А three-level approach to syntax’, TLP, vol. 1, pp. 225-40. 
ANES, F. (1965), Proceedings of the Ninth International Congress of Linguists, 
Е The Hague, р. 420. 2 ) 
BELING, C. L. (1958), ‘Subject and pre 
a Contributions to the Fourth International 
E» O. von (1956), Grundzüge der Hoch 
IRBAS, J. (1953), “Оп the communicative functi 
E and Czech’, Brno Studies in English, vol. 1, PP: 39-68. 
IR BAS, J. (1961), “On the communicative value of the modern English finite verb’, 
Ё Brno Studies in English, vol. 3, РР. 79-104. 
1n BAS, J. (1962), ‘Notes on the function in the act of 
F communication’, Sbornik praci filosofické fakulty Brněnské University, P- A10. 
тива», J. (1964a), Оп defining the t tence analysis’, 1 
E TLP, vol. 1, pp. 267-80. 1 
In BAS, J. (1964b), ‘From comparative word-order studies. (Thoughts on 3 
V. Mathesius’ conception of the word-order system jn English compared with that | 
Е of Czech)’, Brno Studies in English, vol. 4, PP- 111-128. = 
IRBAS, J. (1966), *Non-thematic subjects in contemporary English’, TLP, vol 2, 52 i 


Тр; 2391. 
REENBERG, J. Н. (1963), ‘Some universals of grammar 


Language, Harvard University Press, PP- 58-90. 
TOSH, А., and STREVENS, Р. 


Gun, M. A. K., McIN 

степсег and Language Teaching, Longman. 

Harcner, А. С. (19562), *Syntax and the sentence’, Word, vol. 12, РР. 234-50. 
nt to Word, 


H an нев, А. С. (19566), ‘Theme and underlying question’, Suppleme! 
412; 
MAUSENBLAS, K. (1964), 
EUMD vol. 1, рр. 67-83. 
SE С. Е. (1963), 
D Language, Harvard Universi 
ковзом, R. (1956), Fundamentals 
of language Un: 


3 É 
AKonson, R. (1963), 'Implicatio 
Harvard 


dicate, especially in Russian’, Dutch 


1 Congress of Slavists. 


deutschen Satzintonation, Düsseldorf. 


on of the verb їп English, German E 


^in Universals of d 
| 


(1964), The Linguistic 


*On the characterization and classification of ‘discourses’, 


iversals jn language" in Universals 


Mouton. d 
iversals for linguistics", in d 
рр. 208-19. à 


DESSEN 
JESPERSEN; О. (1969), Analytic Syntax, R 
BE D. (1956), Ап Outline of Engli: Phonetics, Dutton. 
LIMA, E. S. (1964). ‘Negation in En ish’, in The Structure of Language, 
e Prentice-Hall, рр. 246-323: М А и 
АРТБУА, О. А. (1963), : Сехоѕіоуаскіје raboty poslednix let 
L üktual'nogo élenenija predloženija , Jazykoznanija, 
EE, W, R. (1960), Ап English n Reader, Macmillan. — . 
ATHESIUS, V. (1929), "Zur SatzpersPel men Englisch’, Archiv für 
nes Studium der Modernen Sprachen und Literaturen, vol. 84, no. 155, pp. 200-10. 
ovr, P. (1959), ‘O prostiedeich е ашаа то členění’, Acta Universitatis 
E , 
$ Carolinae, Philologica, vol. 1, p: 10. У è mo 
ALA, K. (1966), “O nekotory* problema aktual’nogo Clenenija’, 
d Mathematical Linguistics MOD DAS 02 a 
HUnIGEnR, M. (1935) The Role of Intonation in Spok: 


University Press. 


po yoprosam 
no. 4, рр. 120-27. 


Prague Studies in р 


en English, Cambridge 


František Daneš 231 


STOCKWELL, R. P. (1960), ‘The place of intonation in а generative grammar of 

. English', Language, vol. 36, pp. 360-67. 

Uu ikova, L. (1966), “Some aspects of word order in categorial and 

transformational grammars’, Prague Studies іп Mathematical Linguistics, 

d рр. 159-66. 

__МАСНЕК, J. (1947), ‘Obecný zápor v anglictiné a У &e&tiné", Prague Studies in 
- English, vol. 6, pp. 7-73. d 

Worth, D. S. (1964), ‘Ob otobraZenii linejnych otnoSenij v poroZdajustich 
modeljach jazyka', Voprosy Jazykoznanija, no. 5, pp. 46-58. 


N 
y 


vol. 1, 


[o И d Gei 
232 Intonation and Grammar 
ULL UMS a BAL. AEN, > ur „г, 


Part Four 
Intonation and Emotion 


It is no news to the average speaker that he betrays emotions by the 
do it? Is it only the broad 


tone of his voice. But how does he 
sweep of intonation that conveys fear, happiness, boredom, secrecy 
and doubt, or does the information lie partly in the squiggles or 
perhaps in other changes that have nothing to do with fundamental 
frequency? In the first article Lieberman and Michaels show that we 
do indeed depend on many factors for a sure identification, but that 
of those tested pitch contributes most. 
The second Reading carries the analysis a step farther. It is 
concerned with how rather than how much. Uldall takes the presence 
of emotional meanings for granted and establishes connections between 
particular intonations and particular emotions. Her technique is that 
of the well-known semantic differential. She locates the intonation 
patterns in an ‘emotional space’ whose dimensions are the paired 
Opposites of commonly sensed emotions that average speakers can 
Tecognize and talk about: pleasant-unpleasant, strong-weak, and 


authoritative-submissive. | | { 
Тће question of emotion is not a mere side issue where intonation 
às a part of language 15 concerned, for it is next to impossible to 


Separate emotional meanings from grammatical ones. The first 

example that comes to mind when we want to illustrate the grammatical 
function of intonation is the rising pitch of yes-no questions. But is 

this purely grammatical? Such questions have a falling intonation 
about as often as a rising one, which proves that there is no true 
interdependence. A yes-no question then must have its rise for some 
Other reason than the fact that it is a yes-no question. Is it because of 
àn attitude — more uncertainty, greater curiosity? If so, that is 


Suspiciously close to emotion. 


1-13 


12 Philip Lieberman and Sheldon B. Michaels 


Some Aspects of Fundamental Frequency and Envelope 
Amplitude as Related to the Emotional Content of Speech 


Philip Lieberman and Sheldon B. Michaels, 
frequency and envelope amplitude as relate 


Journal of the Acoustical Society of America, 


Authors! summary 


Pitch pulses were electronically derived fr 
English who each read eight neutral test 


native speakers of American 


Sentences in certain ‘emotional’ modes, 
а happy utterance, 
d by these pitch pulses. The pitch 


Statement, a fearful utterance, 
POVO-type synthesizer was excite 
perturbations, or rapid variations i 
could be smoothed out and the PO 
With a signal derived from the ori 

Tapes were recorded and presented 
Who categorized the emotional mode 
Of the tests show that with unprocess 
Correctly identify the emotional c 
only pitch information was presented 


Per cent of the time. When amplitude i 


ontent 85 per cent o 


„ ‘Some aspects of fundamental 
d to the emotional content of speech’, 
vol. 34, no. 7, July 1962, pp. 922-7. 


om the utterances of three male 


ie. as a question, an objective 
etc. A fixed-vowel 


n the fundamental excitation rate, 
VO could be amplitude-modulated 


ginal speech envelope amplitude. 


to separate groups of naive listeners 
s in forced judgement tests. Results 
ed speech, the listeners were able to 
f the time. When 
tion was made 44 


, correct identifica! 
nformation was added to the pitch 
hing the pitch 


47 per cent. Smoot 


information, the identification rose to ng thi 
he identifications to 


information with a 40-ms time 
38 per cent, while 100-ms smoothin 
Per cent, A 120 Hz monotone with 
the original speech envelope ampl 
tions 


Introduction 
The object of this experiment 
Mental frequency and of amp! 
Content of normal human Speec 
quency, we particularly wishe 
Ditch, that is, the irregularities 
Pertinent to the transmission © 
We used the techniques of speech 5 


thesized acoustic stimuli that differed 


constant reduced t 


amplitude info 
itude resulted in 14 


was to examine the 
litude to the transm 
h. With respect to the fundamental fre- 
d to see whether t 
in the fundamental excitation rate, were 


f emotional information. 


e identifications to 25 
rmation derived from 
per cent identifica- 


g reduced th 


contributions of funda- 
ission of the emotional 


he perturbations of vocal 


ynthesis in this experiment and syn- 
from one another only with respect 


Philip Lieberman and Sheldon B. Michaels 235 


~ 


$ 


to particular acoustic parameters that exist in normal speech. These 
synthesized stimuli were then listened to and categorized by groups of 
subjects in terms of a set of emotional categories. If the presence or absence 
of a particular acoustic parameter causes a difference in the listeners 
judgements of the synthesized acoustic stimuli, and if the presence of this 
particular parameter has been noted in the analysis of human speech, then 
we can state, with reasonable certainty, that this acoustic parameter 1S 
pertinent to the transmission of information and is an acoustic correlate 
of some phonetic or emotional event. 

We wished, in so far as possible, to avoid subjective judgements, such as 
occur when informants are simply asked to state whether a particular tape 
recording sounds more natural than some other tape recordings. We 
therefore used categorization procedures. 


Experimental procedure 
Emotional modes 


In an earlier study (Lieberman, 1961), a set of eight neutral sentences 
was recorded in an anechoic chamber by six male native speakers. of 
American English. Each speaker was instructed to read each sentence with 
appropriate vocal modifications so that it could be identified as belonging 
to one of the following eight categories, or emotional modes: (1) а bored 
statement, (2) a confidential communication, (3) a question expressing 
disbelief or doubt, (4) a message expressing fear, (5) a message expressing 
happiness, (6) an objective question, (7) an Objective statement, and (8) 


a pompous statement. Each sentence was read three times in each mode 
(see Table 1). 


Table 1 List of sentences used in the experiment 


1. The lamp stood on the desk. 

2. They have bought a new car. 

3. He will work hard next term, 

4. His friend came home by train. 
5. They parked near the street light. 
6. We talked for a long time. 

7. John found him at the phone. 

8. You have seen my new house. 


The twenty-four repetitions of each sentence were then placed in 
random sequence and categorized by a group of twenty linguistically naive 
listeners in a forced judgement test to select the most identifiable wennt, 
from each of the eight emotional categories for each sentence. A panel ? 


236 Intonation and Emotion 


trained observers then listened to the same set and rejected those utterances 
that they found to be unnatural or strained. On the basis of these criteria 


the best utterances of each of the eight emotional categories for each of the 


eight sentences of each speaker were selected. 
For the present experiment we took from this selected set the utterances 


of three of the speakers, and again categorized the utterances in terms of 
the eight emotional modes with a new group of ten naive listeners. We 
then proceeded to isolate the frequency and amplitude information con- 
tained in the utterances of this tape recording. 


Preparation of test stimuli 

The first step was to make an intermediate tape for the generation of our 
test stimuli. With pitch-extraction circuits we derived a marker pulse on 
the leading edge of the amplitude peak of each fundamental period. We 


recorded these pulses on à dual-track tape with the original speech on 
the upper track and the derived pitch pulses on the lower track. The two 
Waveforms were then simultaneously displayed by a tape-scanning mech- 
anism on an oscilloscope. The errors in pitch extraction were then manu- 
ally corrected with special erase and recording circuits on the tape-scanning 
device. Our derived pitch pulses were accurate to within 0:2 ms of the 
fundamental excitation. We used this tape to generate all of our synthe- 
Sized waveforms. The derived pitch pulses were used to drive à POVO 
fixed-vowel synthesizer having formant frequencies of 750, 1100 and 2450 
Hz and bandwidths of 70, 80 and 115 Hz, respectively. The output of the 
POVO was made asymmetrical with a diode and RC circuit, the better 
to approximate human sounds. к { 

We subsequently made five different tape recordings, each E ү 
available acoustic parameters in à specific way. The first EE Beet 
synthesized speech made under these conditions used the prive pi а 
Pulses of constant amplitude to drive the PO VO. This essentially 150 ate 
all of the fundamental frequency information of our real speech input, 
removing all phonetic and amplitude information. The original speech 
Waveform was recorded on the lower track of this tape recording and was 
Critically compared with the synthesized waveform on the upper track in 


this andi bsequent tape recordings. 
пива ры g we included amplitude as well as funda- 


On our second tape recordin BS Gu 
mental frequency information. А 20 ms full-wave rectifying circuit was 
itude of the original speech from the 


Used to obtain the envelope ampl ~ У д 
Waveform on the upper track of our generating tape. The derived pitch 


Dulses on the lower track of the generating tape which excited the POVO 


Were then amplitude-modulated by this signal. — : 
Tn our next tape recording We retained the amplitude modulation of the 


Philip Lieberman and Sheldon B. Michaels 237 


POVO. However, we removed the pitch perturbations by converting the 
derived pitch pulses on the generating tape to an analogue signal, smooth- 
ing the analogue signal, and then converting the smoothed analogue 
signal into a pulse stream again. The block diagram for this operation is 
presented in Figure 1. The overall linearity of the frequency transformation 
is presented in Figure 2. 


pulse generator 
pulses derived integrating circuit output frequency 
from spesch analog voltage voltage controlled 
output inversely 
proportional to 
pitch period 


gating and 
amplitude 
modulation 


speech envelope 


the voiced intervals of the original speech 
modulated the output pulses; 


„е produced two tapes in which we excited the PO V О with smoothed 
pitch, retaining full amplitude modulation. The smoothing time constants 
were 40 and 100 ms, respectively. The ouput-pulse generator, which 
excited the PO VO, always returned to 100 Hz in the absence of voicing: 
The System is very similar to that used in some Vocoder pitch-synthesis 
circuits, 

A final tape was made in which we set the output pulse generator to 2 
constant 120 Hz and amplitude-modulated it, again deriving the ampli- 
tude information from the original Speech on the upper track of our gen- 
erating tape. This tape essentially isolated amplitude information from the 


238 Intonation and Emotion 


300 


Input pulse frequency (Hz) 


200 


100 


300 


output pulse frequency (Hz) 


The ordinate is the input-pulse 


smoothing. 
output-pulse frequency of 


he abscissa 15 the 


ые 2 Calibration of pitch 
GE of Figure 1 whilet 

after smoothing. 
nitch and phonetic information contained in the original speech. In the 
KH tee latter tapes the output pulse generator was gated so that only the 
Oiced parts of the utterances Were synthesized. 


Listening tests 
ed for each of these tapes with separate 


Listening tests were conduct 
us of ten naive listeners each. Each tape was presented to a group 
i^ ich had not heard any of the tapes before. By restricting our identifica- 
ns to those obtained during the listener's first exposure to the stimuli, 
e minimized the chance that our listeners might be learning to identify 
е utterances by means of the idiosyncrasies of the speakers from 
d amplitude parameters were derived. In a 


шге utterances the pitch ап 
rger group of speakers these idiosyncrasies might well be ignored by the 


Philip Lieberman and Sheldon B. Michaels 239 


listeners who would ascribe them to the normal free variation that is 
encountered in more usual listening situations. In other words, we tried 
to get the listeners to react to the cues that they would use in listening to 
normal English. 


Observations 
General effects 


In Figure 3 we have tabulated the percentages with which the utterances 
were correctly categorized with respect to the emotional modes for each 


1  originalspeech 


2 perfectpitch,no 
amplitude modulation 


3  erfectpitch, 
amplitude modulation 


4 40mssmoothed pitch, 
amplitude modulation 


& 100mssmoothed pitch, 
amplitude modulation 


amplitude modulation zz 
5 constant pitch КЕШЕҢ 


0 20 40 60 80 100 


correct responses (25) 


Figure3 Percentages of correct identification ofthe emotional modes by 


he үп wide-band speech 
om the ori 
fixed POVO; (3) perfect pitch pulses drivi тра PEE dinge 


of our six tape recordings. Fach run represents the responses of a different 
group of naive listeners on their first exposure to the test material. We 
note several effects. When the unprocessed speech was presented, 85 pe 
cent of the modes were correctly identified. When we presented only pitch 
information, only 44 per cent were correctly identified. When amplitude 
information was added to the pitch information, the identification rose (0 
41 per cent. Smoothing the pitch information with a 40 ms time constant 


240 Intonation and Emotion 


^ 


reduced the identifications to 38 per cent, 100 ms smoothing reduced the 
identifications to 25 per cent, while a monotone with amplitude modula- 


OH 


tion resulted in only 14 per cent identification, which, however, is still ` 1 


significant at the 0-006 level with respect to the results that would occur 
through chance guesses. 

The following aspects of the data should be noted. First, fundamental 
frequency alone is not able to transmit full emotional information. Second, 
amplitude information plays a small though significant part in the correct 


= 
o 
o 


original speech (tapo1) 


correct responses (9) 


100 


smoothing constant (ms) 


n the Identification of the modes. Tapes 
stimuli on all three tapes had the same 
litude modulation; only the fine structure 


Figure 4 Effect of pitch smoothing О! 
8,4 and 5 of Figure 3 are plotted. The 
gross pitch range and the same amp! 
Of the pitch was varied. 


Tecognition of emotions. Third, noting the results of tapes three, four and 
five, and recalling that all three had the same gross pitch range and the 
Same amplitude modulation, we see that the fine structure of the funda- 
Mental excitation rate has а percep. 
motional modes (Figure 4). 


Philip Lieberman and Sheldon В. Michaels 241 


tible effect on the identification of the.. 


Effects of emotional modes 


In Figure 5 we have ordered the different emotional modes according to 
the frequency with which they were correctly identified for each of our 
processed tape recordings. 


Tape(1) Q) (3) (5) (6) 
Perfect Perfect 100 ms. Атр. mod., 
Original pitch, ^ pitch, smoothed constant 
speech no amp. amp. pitch, pitch 
mod. mod. amp. mod. 


HIGHEST 
IDENTIFICATION 1 6 6 6 7 
P 6 1 1 13 4 
f 7,8 7 3 7 1 
3,5 3 7 5 5 
2 4 8 4 3,6,8 
4 8 4 2 2 
LOWEST 5 5 8 
IDENTIFICATION 2 2 


1) is BOREDOM 3) is DISBELIEF 


i 5) is HAPPINESS 7) is STATEMENT 
2) is CONFIDENTIAL 4) is FEAR 


6) is QUESTION 8) is Pompous 


Figure 5 Relative identifiabili 


е. ty ofthe emotional modes for each processing 
condition, 


Effects of individual speakers 


Individual speakers also seem to favor 
meters for the transmission of the same e; 
of this is seen in Figures 6 and 7, 


the use of different acoustic ра 
motional information. Апехатр' 


242 Intonation and Emotion 


Speaker 1 

Response 1) 2739 4 ДА ОЛ ДР. 
671 063 172 000 016 016 203 109 
188 311 094 016 047 063 422 109 
000 078 687 031 125 250 063 016 
078 016 047 796 203 031 016 063 
016 016 078 656 234 063 031 156 
031 047 109 000 000 828 172 063 
234 172 063 016 000 016 702 047 
047 125 234 016 109 047 156 516 


очо URUNE 


Stimulus 


Speaker 2 
Response 1 2 3 


OND E OH rs 
© 
е 
е 
= 
= 
= 
со 
со 
IS 
Б 
со 
IS 
~ 
с 
© 
S 
= 
N 
© 
2 
E 
© 


& 6 000 031 078 078 

Š 7 266 141 047 000 000 031 749 016 d 

a 016 219 125 031 031 000 078 750 * 
Speaker 4 


Кезропзе 1 ОЗА 
812 141 016 016 031 000 203 031 


203 266 031 031 078 016 484 141 
000 000 891 031 047 281 000 000 
016 016 156 515 219 281 000 047 
016 031 156 422 406 063 031 125 
016 125 344 000 000 718 047 000 
172 297 047 047 016 016 546 109 
109 219 016 031 063 000 203 609 


060-10) л ә N20 


Stimulus 


atrix for each speaker fortape3 
= have the emotional mode 


(in % x 10-2) (cf, Figure 3) 


Speaker two's mode-5 stimuli were 


Mode-5 and 4 stimuli of speakers one and four. 


Philip Lieberman and Sheldon B. Michaels 243 № 


Speaker 1 
Response 1 2 3 А5 6 7 8 
7 280 188 344 047 063 000 250 6878 
266 171 156 078 016 172 328 003 
078 188 344 078 109 266 156 031 
047 016 250 202 266 172 078 219 
016 359 156 344 172 047 109 


очо л ою к 
5 


S 6 063 156 156 047 031 578 219 000 

Š 7 563 109 078 047 047 047 281 078 

© 141 094 313 016 375 063 094 154 
Speaker 2 


Response 1 Dy A PS Be CR VOD] 


очо bah 
= 
= 
~ 
© 
~ 
о 
w 
IS 
со 
IS 
~ 
S 
a 
с 
© 
~ 


Stimulus 


Speaker 4 


Response 1 Se А] 95. 016, 7 ER 
345 156 109 000 078 125 328 109 
156 219 156 109 063 047 328 172 
000 047 405 094 250 313 094 047 
016 016 344 170 297 313 031 063 
016 063 250 172 453 109 078 109 
078 172 219 031 031 547 141 031 
359 297 016 031 078 031 329 109 
297 297 094 063 078 031 313 077 


PID лч жь ою — 


Stimulus 


Figure 7 Normalized stimulus-response matrix for each speaker fortape 5 
(in 95 x 1072) (cf. Figure 3). Note that when pitch was smoothed, speaker two's 


mode-5 stimuli were more often confused with mode 4 than were the mode- 
5 and 4 stimuli of speakers one and four. h 


In Figure 6 the complete stimulus-response matrix has been tabulated 
for each speaker for the tape having perfect pitch with amplitude modula- 
tion. We note that for speakers one and four the listeners tended to make 
mode-4 responses to mode-5 stimuli. They did not confuse speaker two's 
stimuli in this manner. However, when we look at Figure 7 in which the 
matrices have again been tabulated for the tape having 100 ms smoothed 


244 Intonation and Emotion 


60) 


50) 


T "i 


Ac=[tn-t,-1] 


normalized frequency of occurrence 


40 


30) 


20 


10 


02 04 06 


| Ат (ms) 
Figure8 Distribution of the absolute value 
of the difference between the durations d^ " 
adjacent pitch periods? in m Toros —— Speaker’ S 
i CR d ker? mode 
тоде-5 stimuli of each spea! Рава speaker о 


difference in ms is plotted on ie 
While the normalized frequency о 
Occurrence is plotted on the ordinate, The 
SE E RE ker two's distribution favors 
analytical study (cf. figure 1). Note that speaker SE 
lar y E: study between the durations of his adjacen pitch p EUR 
tends ie eer pitch perturbations for his mode-5 stimuli than 
Speakers one or four. 


oe speaker 4 


station (‘vibration of the vocal cords"). 

i -od is one cycle of glottal excitation (‘vi : 
СЕ hi ү peer is SS in hertz (Hz or cycles рег second), a fundamental of 
GE correspond to а pitch period of 135 ог 10ms. The pitch 


100 Hz, for example. would h 
Period is thus the inverse of the fundamental at any instant. 


"| 


i de-4 
» pitch, the situation is completely reversed and the listeners make mo 


responses to speaker two's mode-5 stimuli. We canseea же ee 
tion for this behavior in Figures 8 and 9, which present some of the 


SA [71541] 


normalized frequency of occurrence 


н Speaker 1 


Së speaker2 тоде 4. 


ОО speaker4. 


Figure9 Distribution of the absolute value between the duration of adjacent 
pitch periods in ms forthe mode-4 stimuli of each speaker. Note that the (а 
mode-4 stimuli of speaker two аге not differentiated from those of speakers 0n 
or four by a different distribution as are his mode-5 stimuli. Thus when the pite 
perturbations were smoothed ou 


t more confusions between mode 5 and 4 
occurred for speaker two (cf. Figures 6 and 10. 


` 246 Intonation and Emotion 


of the preliminary analysis of the pitch perturbations. In Figure 8 we can 
see that speaker two's distribution of the differences between adjacent 
periods favors greater pitch perturbations for his mode-5 stimuli than do 
either speakers one's or four's distributions. When the same plot is made 
in Figure 9 for mode-4 stimuli, the distributions of all three speakers are 
quite similar. Apparently speaker two's тоде-5 stimuli are differentiated 
from his mode-4 stimuli through the presence of these perturbations. 
When we smooth out the fine structure of the pitch variations we remove 


the distinction between modes 5 and 4 for speaker two, causing the listeners 


to confuse the two modes. 


Stability of emotional categorizations , 
The results of the categorizations of ће emotional modes by naive listeners 
on their first exposure to the stimuli were rather stable. As a check we ran 
the perfect pitch with no amplitude-modulation tape (cf. Figure3) with two 
different groups of listeners. Both groups correctly identified the utterances 


43:5 per cent of the time. 


Discussion 

Relevance of experimental results to 
The level of correct identifications of the emoti | 
from 85 per cent to 47 per cent when we removed all phonetic information, 
retaining only pitch and envelope amplitude information. (By phonetic 
information we mean here any information directly related to vocal tract 
configurations or the spectral composition of the glottal excitation func- 


tion, i.e. information that would be transmitted in the spectrum channels of 
-band spectrogram. Roughly this is the 


а Vocoder or displayed in a narrow: ) Ч 
information conveyed, in writing, by letters.) Since phonetic events play 
So important a role in the transmission of the emotional modes 
We might wonder whether pitch plays а secondary role in the presence 
of phonetic information and whether pitch information is immaterial or 
` negligible in the presence of a correct and complete phonetic description 
Of the speech material. Certain recent experiments suggest that this is not 
the case and that in the presence of phonetic information pitch still plays 
an important role in the transmission of emotional information and 

the enhancement of natural quality. у 
Abramson (1959) їп an experiment with Vocoder-processed speech 
With a tone language (Thai), passed normally spoken Thai words, minimally 
distinguished by tone, *- - • through the Vocoder with the fundamental 
onstant. No discriminations were made! With 


frequency of the buzz kept © › i 
hiss alone, however, the results were better than in the natural whisper. We 


less restricted speech signals 
onal modes in the data fell 


Philip Lieberman and Sheldon B. Michaels 247 


m 


conclude that the features were present in the normal speech but that in 
the presence of the buzz listeners were set to hear pitch variations. Inspec- 
tion of spectrograms suggests that tonal oppositions in Thai whispering 
Jean on such concomitant features as changes in intensity, relative dura- 
tions of vowels and small variations in formant frequencies’, Denes (1959) 
reported similar results for English intonation. 

Kersten, Bricker and David (1960) reported on an experiment in which 
they used digital-computer processing to remove the pitch perturbations 
of sustained vowels spoken by two talkers. In А-В pairs, these processed 
samples were presented, together with the originals, to naive listeners who 
readily identified the machine-processed samples, which sounded noticeably 
mechanical compared to the originals. 

It is therefore reasonable to infer that the results of our experiments, in 
which all phonetic information was removed, are relevant to more general 
speech transmission problems in which phonetic information is of course 
present. 


Conclusions 


1. There is no one single acoustic correlate of the emotional modes of this 
experiment. Phonetic content, gross changes in fundamental frequency, 
the fine structure of the fundamental frequency, and the speech envelope 
amplitude, in that order, all contributed to the transmission of the emo- 


tional modes. Durational cues were not isolated in any stage of the experi- 
ment and therefore cannot be discussed. 


2. The different emotional modes did not all depend to the same degree ОП 
all the acoustic parameters. Different speakers also favored different 
acoustic parameters for the transmission of the same emotional mode. 


3. The fine structure of the fundamental frequency, that is, the perturba- 
tions in fundamental frequency, appears to be an acoustic correlate of the 
emotional modes. When these perturbations were smoothed out confusions 
кесеп the emotional modes increased. l 

Most current systems of linguistic analysis of intonation seem incom 
plete in that they merely note gross changes in fundamental frequency? 
minimize the role of amplitude and phonetic variations, and entirely ignore 
the fine structure of the fundamental frequency. We have seen In this 
experiment, however, that these additional dimensions are responsible foj 


ad. fraction of the total emotional information transmitted in hum? 


\ 


248 Intonation and Emotion. | " 


References 
AnBRAMSON, А. S. (1959), 
language’, J. Acoust. Soc. Amer.» 
Denes, P. (1959), ‘Preliminary inves 
J. Acoust. Soc. Amer., vol. 31, р. 852. 
KERSTEN, L. G., BRICKER, P. D., and DAVID, E. E. Jr (1960), ‘Human or 
J. Acoust. Soc. Amer., vol. 32, p. 1502. 


machine: a study of voice naturalness', s 
LIEBERMAN, P. (1961), ‘Perturbations in vocal pitch’, J. Acoust. Soc. Amer., 


vol. 33, p. 597. 


*Vocoder output and whispered speech in a tone 
vol. 31, p. 1568. 
tigation of certain aspects of intonation', 


L-14 


Philip Lieberman and Sheldon B. Michaels 249 


E эы | 


13 Elizabeth Uldall 


Dimensions of Meaning in Intonation 


Elizabeth Uldall, ‘Dimensions of meaning in intonation’, from Jn Honour of 
Daniel Jones: Papers Contributed on the Occasion of his Eightieth Birthday, 12 
September 1961, edited by David Abercrombie, D. B. Fry, P. A. D. MacCarthy, 
N. С. Scott and 7. L. М. Trim, Longman, 1964, pp. 271-9. 


[Editor’s note: The first two paragraphs in the Reading as it appears here are 
the first two paragraphs of the Reading referred to as Uldall, 1960.] 


It is clear that some kind of meaning is conveyed by the intonation of 
connected speech in both tone languages and non-tone languages. There 
is little agreement about the terms in which this meaning is to be described; 
every writer on the subject employs an open-ended supply of terms for 
this purpose. One kind of meaning conveyed is, however, clearly social 
and emotional rather than referential. Intonation can express social 
attitudes: speaker to listener: ‘It wasn’t what she said, it was the way she 
said it!”; to subject matter: ‘Well, don't get in a temper with me; Гт not 
the Income Tax collector’; to the world in general: *He sounds so arro- 
gant’, ‘Don’t whine!’ 

Attitude measurement seemed a promising technique by which to 
attempt to find out whether a group of subjects from the same linguistic 
community would in fact agree on the ‘meanings’ of intonations, and 


whether some few very general ‘dimensions of meaning’ in the emotional 
area could be extracted, 


The experiment described here! consisted in offering to a group of subjects 
a number of sentences on each of which sixteen intonation contours had 
been imposed synthetically, and asking them to rate these on a set of scales 
consisting of opposed adjectives, with a view to investigating the attitudes 
or emotional meanings conveyed by the various contours (see Osgood, 
Suci and Tannenbaum, 1957). Professor Osgood’s ‘semantic differential’ 
appeats to be a suitable technique for investigating the emotional mean- 
ings of intonation contours, since it is precisely ‘emotional meaning’ 


1. The work described here was carried out with the benefit of research funds fto 
the Haskins Laboratories, New York, in connection with a grant from the Carnegie 
Corporation of New York. 

I am much indebted to Dr Boris Semeonoff of th 


f 
e Department of Psychology © 
Edinburgh University for statistical advice. 3 i 


250 Intonation and Emotion 


which is strongly present in intonation, and with this aspect of meaning 
the semantic differential deals most successfully. 

The fifteen subjects were speakers of American English, and the original 
recording on which the contours were synthesized was spoken by an 
American. The contours, shown in Figure 1, were the same as those used 
in an earlier experiment (Uldall, 1960). They were intended to cover all the 
kinds of variation which differentiate intonation contours, though of course 
nothing like all the possible combinations of variables were represented. 


"The variables are: 


Range: wide/narrow. 
Pitch reached at end of contour: high/mid/low. 
Shape of contour: one direction/with a change of direction. 


Treatment of weak syllables: 


(a) Continuing the line of the strong syllables. 

(b) Rising above the line of the strong syllables. _ 

(c) Falling below the line of the strong syllables. 
The ‘scales’ used were the same as in the earlier experiment, with the 

addition of authoritative submissive, unpleasant|[pleasant, genuine|pretended 

(feeling) and weak|strong(feeling), so that tbe page on which the subjects 

Were asked to rate each contour on each sentence appeared thus: 


bored — — — interested 
' polite — — — rude 
timid — — — confident 
Sincere — — — insincere 
tense — — — relaxed 
disapproving — — — approving 
deferential — — — arrogant 
impatient — — — patient 
emphatic — — — unemphatic 
agreeable — — — disagreeable X 
authoritative — — — submissive 


unpleasant — — — pleasant 
genuine — — — pretended 
weak — — — strong 
These terms were arranged with seven places between them; the subjects 
Were instructed that the places next to the terms should be checked to 
indicate ‘extremely’ (bored of interested, etc.), the next place a little 
farther in from the terms to indicate * quite" (bored or interested, etc.), the 
twi ; i to indicate ‘slightly’ (bored or interested, 
© places flanking the middle ` 


Si Elizabeth Uldall 251 


Ӯ 


1092 ~ 


etc.), and the middle space to indicate ‘neutral’ or ‘neither’ in relation to 
the scale under consideration. 

The extra scales mentioned were added to the original ten in an effort 
to find more ‘central’ terms for the dimensions of emotional meaning 
which appear to be most strongly represented in intonation: *pleasant/ 
unpleasant’, ‘authoritative/submissive’ and *strong/weak’ (feeling ex- 
pressed). In the material on which the diagrams of A, B, C, D, E are based, 
three of the new scales were in fact used as being suitable characteristic 
terms in these dimensions. The addition ‘genuine (feeling)/pretended 
(feeling)? was not successful; ‘pretended’ feeling must either not be 
expressed in intonation, or be a function of intonation in context; all the 
contours presented were rated as expressing ‘genuine’ feeling. 


The four sentences were as follows: 
A. Statement: ‘None of the members are going." 
B. Yes-or-no question: * Was it arranged at the meeting?’ 
C. Question-word question: ‘What did he think they were doing?’ 
D. Command: ‘Bring it along to the meeting.’ 


To these was added a nonsense-sequence: 
E. ['soomoya 'ра!бәгә zeng) 


All of these sentences consisted of th 
of strong and weak syllables. The rea 
Suitable as remarks between social equals. Th 
Alvin Liberman of the Haskins Laboratories, 
Necessary to speak them all on a rising conto 
syllables should not be so low in intensity as to 
The contours were ‘applied’ to t 
Synthesizer. г 

Each subject took each test twice, some in the same order both times, 
and some in the reverse order. This was done partly in order to increase 
the number of judgements to be averaged, and partly to see whether the 
judgements appeared to be affected by the order in which the contours 
Were presented: were the subjects judging the contours in relation to their 
Whole experience of intonation, or in relation to the preceding contours? 
It is clear that the former is the case; the variability in judgement was more 


Or less constant for each subject as à person, from two-thirds of a scale 
and two-thirds scale units for the ‘worst’ 


Unit for the * best" subject, to one 
One (‘scale unit’: the difference between e.g. ‘slightly bored’ and ‘quite 
bored’), The average variation over all the texts was 1-12 scale units on 
test/retest. 

These are larger 


е same number and arrangement 
] sentences were intended to be 
ey were recorded by Dr 
New York. He found it 
ur in order that the final 
make synthesis difficult. 


һе sentences by means of the Voback 


‘errors’ than Osgood found on test/retest for subjects 


Elizabeth Uldall 253 


Пир РЧ 


‚ dimension. 


APT AT 


judging word ‘concepts’ in similar tests: ‘... average errors ... always 
less than a single scale unit ... and for evaluative [pleasant/unpleasant] 
scales average about half of a scale unit? (Osgood, Suci and Tannenbaum, 
1957, p. 131). у 
The contours themselves also varied in the amount of test/retest varia- 
bility in the judgements made of them: e.g. contour no. 15 (raised weak 
syllables, final rise) is usually near the most variable end of a list of the 
contours arranged to show this characteristic. ‘Raised weak syllables’, 
though they certainly occur in American intonation, in the speech of men 
as well as women, are sometimes said to be a ‘woman’s intonation’. The 
variability of the judgements in this case may indicate that this contour 


‚ was less familiar to the subjects than the other contours were, or that it 


was unfamiliar to some of the subjects. 

There were also differences in the amount of variability on the various 
Scale terms: the subjects were least variable on the scales expressing the 
‘pleasant/unpleasant’ dimension, and most variable on those expressing 
the ‘strong/weak’ опе. In other words, they were more consistent about 
their own reactions to the contours than about what they judged the 
“speaker’s’ intention to be. 

Factor analyses of the correlations between the various scales for each 
part of the experiment – A, B, C, D, E — were carried out with a view to 
extracting the main “dimensions of meaning’ conveyed by these intona- 
tions. As in the previous experiment, the ‘pleasant/unpleasant’ factor was 
by far the strongest. The grouping of the scale terms in the factor analyses 
made it clear, however, that this time the *authoritative/submissive" 
factor came second and the “strong (feeling)/weak (feeling) factor third. 
This was the reverse of the earlier experiment, though there were some 
indications in the earlier one that the arrangement might not be the same 
for all four sentence types, It is possible that the uniform emergence 0 
the factors on all the sentences this time is related to a larger and better 
choice of scale terms. 

Two scales were chosen to represent each factor in the construction ОЁ 
the “semantic space’ diagrams (Osgood, бис! and Tannenbaum, 1957, P- 
114), Figures 2a, b, с, d, е, which show the relations of the various contours 
to the three ‘dimensions’ and to each other. Subjects’ scores (reversed where 
necessary) on bored|interested and unpleasant|pleasant were averaged (9 
represent the ‘pleasant/unpleasant’ dimension; 
and timid|confident were averaged for *authori 
weak|strong and emphatic|unemphatic were avera, 


authoritative/submissiv® 
tative/submissive' 5 e 
ged for the ‘strong/we# 


The diagrams display, for each sentence-type and thenonsense-sequence 
three dimensions: the right half of the diagram shows contours judged 


254 Intonation and Emotion 


2d 


*submissive" 


"strong" 
(feeling) 


0 ‘pleasant’ 


4 Zus 
‘weak’ 


‘authoritative’ 


A:'None ofthe members are going” 


Figure 2a 


easant’. The near half shows those judged 


bmissive’. Solid lines rising from the point 
show ‘strong’ judgements, the height of 
е ‘strength’ of the feeling; dotted lines 
п of the first two scores show 
g how ‘weak’ the 


‘pleasant’, the left half ‘unpl 
authoritative’, the far half ‘su 
of intersection of these two scores 
the line being proportional to th 
descending from the point of intersectioi 
Weak? judgements, with the length of the line showin 


*gubmissive" 


‘strong’ 
(feeling) 


^ 
Unpleasant’ 


*guthoritativo" 


B:'Wasit arranged atthe meeting?” 
Figure 2b 
Elizabeth Uldall 255 


“submissive” 


‘strong? 
(feeling) 


Й 


TOREN л | 
i 

*weak" 

(feeling) 


contour was judged to be. The contours are shown at the ends of the 
vertical lines. 7 


Contours тау thus be described by three terms: no. 8 is in all cases 
‘pleasant, authoritative, strong’; no. 4 is ‘unpleasant, authoritative, weak’ 
on the statement and both types of question, “unpleasant, authoritative, 


“authoritative” 
C:‘Whatdidhe think they were doing 2" 
Figure 2c 


\ 


‘submissive’ 


“strong” 
(feeling) 


“unpleasant” 


‘pleasant’ | 
| 
cl 
(feeling) 
2. 2 
25 2 d 0 1 2 
= "authoritative" 
—S 
- D. Bringitalongto the meeting" 
Figure 2d 


256 Intonation and Emotion 


"ui 


‘authoritative’ 


E. Nonsense sequence: (‘soamave‘paldaro‘zenip) 
Figure 2e 


e contours bear the same description in 
ar-synonyms in intona- 
dimension of 


strong’ on the command. Wher 
these terms, as e.g. nos. 8, 9 and 10, they may be ne 
tion, or it may be that they would be differentiated on some 


meaning not investigated. A 

The contours fall about equally into the ‘pleasant? and ‘unpleasant’ 
sectors. Few contours appear in the ‘submissive’ sector; this may mean 
that there are few ‘submissive’ intonations, or it may be that ‘submissive- 
hess’ is expressed less readily by intonation than by tempo or voice- 


quality, variations in which were of course expressly excluded from this 
also excluded, which may bear on 


experiment. The effects of context ar 
the fact that fewer contours are considered ‘weak’ than ‘strong’. 
tion differs markedly from the 


Figure 2c for the question-word ques 
others, Within the material of the experiment, it does not appear to be 


Possible to be ‘submissive’ in asking this type of question, and it is 
difficult to convey ‘weak? feeling. This is perhaps not a phonetic observa- 


tion, but it is certainly of some linguistic interest. 
The contours which are most nearly ‘neutral’ on the various sentence- 


types are as follows: 


Statement: Final rises ending at mid pitch. 
Yes-or-no question: Final rises ending high. 
! Question-word question: Final rises, ending high or mid. d 


Command: Final rises, ending high ог mid. 


Elizabeth Uldall 257 


The ‘neutral’ contours for the yes-or-no question can perhaps be related 
to the American English contours usually described as typical for questions 
of this kind. The others are difficult to relate to any norm. С 

Generalizing over the five tests, the three ‘dimensions of meaning 
postulated here are associated with the elements in contour variation in 
the following ways: (terms in parentheses show less consistent connection 
with the dimension) 


‘pleasant’ rises ending high 
change of direction [excluding no. T] 
“unpleasant” raised weak syllables 


(lowered weak syllables) 
(narrow range) 
‘authoritative’ wide range 
change of direction 
rises ending at mid 
raised or lowered weak syllables 
(final fall) 
“submissive” (rises ending high [excluding no. 8]) 
‘strong’ feeling wide range d 
change of direction 
lowered weak syllables 
(rises ending at mid) 
‘weak’ feeling (narrow range) 
(raised weak syllables) 


The ‘positive’ ends of the dimensions are more easily characterized than 
the ‘negative’ ones. 


Where the contours are rated differently on the different sentences, it 
can be seen that the less ‘lively’ a contour is, the more variable it is in 
meaning. The narrow-range ‘smooth’ contours nos. 1, 3, 4 and 6, vary 
most often from one sentence to another. The two rising contours, 1 and 
3, vary on all three dimensions; the two falls, 4 and 6, are always very 
‘unpleasant’, but can be ‘authoritative’ or ‘submissive’, ‘strong’ ОГ 
‘weak’. 

The more ‘lively’ the contour is, the more stable is its position in the 
‘semantic space’ over the different sentences. Contours nos. 8, 9 and 10, 
involving wide range and a change of direction, always occupy the same 
sector of the space, the ‘pleasant, authoritative, strong’ one. 

The less ‘interesting’ the intonation contour is, the more influential the 
sentence itself is in the judgement of the total effect, and vice versa. 


258 Intonation and Emotion 


References 
Огвоор, C. E., Suct, G. J., and TANNENBAUM, 


Meaning, University of Illinois Press. 
ULDALL, E. T. (1960), * Attitudinal meanings 
Language and Speech, vol. 3, рр. 223-34. 


P. H. (1957), The Measurement of 


conveyed by intonation contours’, 


Elizabeth Uldall 259 


1 $0 3 Ра А ] 
e 1 
OSES 
pee eee PM P 
| ШЖ, 


i... TN y M TORRE E s 
k ^d t $ j T Юр 1 › Lat re 
Ју ААЙЫ 4 Кот Le SM is 
eo, EEN Go Sa ON 
А T 4 ү LI * 
ТЫЫ ЛЛ Ж 


we 


"^ aS w M' AX ЖЛ CM 
A xat. M. ~ TR Pu ЖАШЫ 
Nhu y "Wt 12 у Es M HU DU. 
Ч io ДА 
ed o rur ee 
^ Mr WË Oz фі 


Pon 


OE о ПА НИ 
^ | ba wr Р t 


Part Еме 
Intonation and Music 


The melody of speech is the point at which music and language meet. 
А question that interests both musicians and linguists is how much one 
depends on the other. In Western cultures music conveys mainly a 
dynamic and emotional message. The dynamic suggests some kinship 
with rhythmic work and the dance. The emotional may well be 
related to Western uses of pitch in language, which as the articles in the 
last preceding section show are heavily freighted with emotive meanings. 
George List's Reading approaches the question in a culture where it 


can be framed more precisely. Thai is a tone language. It distinguishes 


three relatively level ( register") and two moving (‘contour’) tones, 
When Thais sing are the melodic tones 


distinctive for word meanings. W! 

of the music made to conform to the linguistic tones of the lyric? List's 
answer is yes, in the main. The reader may entertain himself with some 
further questions: Is the music that accompanies a tone language more 
intellectual, or at least less emotive, than Western music? Is the 
difference between a music based at least partly on intonation and а 
music based on tone one factor in making us feel among friends with 
Hungarian or Finnish music and among strangers with the music of 
China and the rest of the Orient where tone languages ате spoken? 

If there is any connection at all between emotion and music, one 
expects the music of Thailand to differ from that of the 
English-speaking world. But what about differences within the latter? 

Is it possible that even dialectal differences are reflected in music? 
American and British English are rather good for making a comparison, 
as there are some pretty characteristic differences in their intonational 
preferences, In the second Reading, Robert A. Hall, Jr points out these 
differences and theorizes that they may account for the popularity of a 
typically British composer in Britain and his unpopularity elsewhere. 

What the first two Readings do is demonstrate that since both musical 
melody and speech melody reside in the same human heads, they are 
bound to affect each other in some way. How deep the influence may go 
is another question: Thai tone is no more emotional, so far as we know, 


than the distinctive sounds of the vowels, and aspects of intonation that 
characterize one dialect as against another (specifically British and 
American English) need not at the same time convey a mood; these 

are more or less arbitrary associations. It is possible of course that 
embedded in music these same contrasts may take on an emotional 
tinge; but we ought to be able to put the question frontally. Fónagy 
and Magdics attempt to do so in the third Reading. Where language 
and music share the burden most intimately is in the direct transmission 
of an emotional message. Given that both do this, does each do it in 

its own way, or do they share the same means? The answer, to be valid, 
must be cross-linguistic; Hungarian is chosen, and contrasted with 
French, English and German. Not only are striking resemblances 
discovered between the means used by language and music, but also 
between Hungarian music and intonation and those of the three other 
languages. Since these are Indo-European and Hungarian is not, there 
has to have been either a wide sharing of culture or a close instinctive 


tie between emotions and melodic curves that points to a common 
origin. \ 


А question remains, which is the extent to which musicians more or 
less consciously put the emotive devices of language into music. 
especially vocal music. The domain of music is that of all sound; a 
composer or a folksinger can choose to imitate the bleat of a calf ora 
roll of thunder. How much of the resemblance between intonation and 
music is artificial and how much springs from a direct welling up of 
joyous sound when one feels joyous, regardless of the medion? 1 


з 


262 Intonation and Music 


14 George List 
Speech Melody and Song Melody in Central Thailand 


George List, ‘Speech melody and song melody in Central Thailand’, 
Ethnomusicology, vol. 5, no. 1, 1961, pp. 16-32. 


Thai or Siamese is a tone language. In a tone language the relative pitch 
at which a syllable is uttered or the inflection given to it may be phonemic, 
that is, may affect the meaning of the syllable. For example, the syllable 
kai may have one meaning if uttered at a relatively high pitch, another 
meaning if uttered at a relatively low pitch, and а third meaning if uttered 
with an obvious downward inflection. 

Thai therefore has what may be termed ‘speech melody’. The intelligi- 
bility of any utterance of a speaker of Thai depends to a certain extent 
upon the accuracy with which he relates the pitch contour of this utterance 
with the pitch contours of the utterances surrounding it. In our Western 
culture we expect a coordination of speech accent and musical accent in 
song but we do not expect the melody of song to follow the intonation of 
our speech. In Thailand both speech and song have melodic contours. In 
what relationship do the two stand to each other? Is the musical melody 
subservient to that of the language or are the language contours modified 
to follow the musical contours? Must the contours of the music carefully 
follow those of speech so that complete intelligibility will be preserved or 
is the music free to construct its own melody independently of the melody 
of speech, the meaning of the language used then being understood by 
considerations of context? These are the questions to be considered in this 
article. 

It should be added in passing that the length of single vowels is also 
]lable uttered at a particular pitch or with 


phonemic in Thai. The same Sy 

a particular inflection and containing à short vowel has а different meaning 

than an otherwise similar monosyllabic utterance containing a long vowel. 
lody have both pitch and 


Thus in Thailand speech melody and song те! 
wever, this discussion will be limited to the 


thythmic characteristics. Ho 
problems of pitch relations between speech melody and song melody. 


Temporal considerations will be omitted. \ 
Thailand is divided into five cultural-geographical areas. The tones 


used in the dialects spoken in these five areas differ considerably in number 
and type. This analysis will be limited to the tonal system in use in Central 


George List 263 


+ oe! 


Thailand, the area surrounding Bangkok, the capital city. This Central 
dialect is also the official language of Thailand. 

There is no great unanimity of opinion among speakers of the Central 
Thai dialect or among scholars who have studied it as to the types of tones 
used. However, the tones are generally considered to be five in number. 
Of these three are register tones and are referred to as middle, low and high. 
The other two are contour tones, one forming а descending contour and 
the other an ascending contour. The Thais use the terms just given to 
describe the register tones, middle, low and high, and these terms have 
approximately the same general connotations in Thai as they have in 
English. However, the Thais apparently have no commonly used terms 
to describe the contour tones (see Figure 1). 


Figure1 


Figure 1 presents the schematic organization of the tones used in the 
Central Thai dialect as given by Haas (1956). The first tone is a high register 
tone with a slight downward inflection; the second, a medial tone with 

_ a very small downward inclination; and the third ene low and flat. The 
fourth - which the writer has designated falling — begins at the high 
level, holds this pitch momentarily, then slides down to the low level and 
also holds this pitch momentarily. The fifth tone — which has been desig- 
nated rising — presents a similar organization in inverse order. 

The second tonal scheme, presented in Figure 2, was given to the writer 
by Swat Sukontarangsi, a graduate student at Indiana University and 2 
native of Central Thailand. Swat has taught the Thai language to children 
in the schools in Thailand and to speakers of English in the United states: 
According to this informant this scheme is commonly used in teachin 
Thai to speakers of English. In this form the three register tones ^'^ 
presented purely as level or flat phenomena. The falling tone begins above 
the middle tone but below the high tone and then slides downwat 
indeterminately. The initial and final points of the risi are aP 
indeterminate. OR wi) А 


In Thailand, as in many parts of the Orient, song or chant is used SN 


264 Intonation and Music 


Figure2 


g either chant or song in reciting their 
m the beginning of a recitation of the 
alphabet by a six-year-old pupil in the Bangkok schools. Although we may 
consider this to be song, or, at least, chant, the Thais refer to this genre 
as ‘recitation’. They do not consider it to be song. The full alphabet con- 
sists’ of forty-four letters. This excerpt includes the first fourteen (see 


Figure 3). 


mnemonic device, the pupils usin 
lessons. Figure 3 is an excerpt fro 


Figure3 Recitation of the alphabet 


Тће numbers above the staff in Figure 3 refer to syllables, each syllable 
being held from one number until the following number. The actual sounds 
of the language have not been given since they are immaterial to this 
investigation. The capital letters found underneath the staff refer to the 
tones used in speech. 


George List 265 


L-15 


2x 


Indiana University administers both Education and Government projects 
in Thailand and there are usually some fifty Thai graduate students 
registered at the University. Several of these students, all from the Central 
area, acted as informants in this study.! Each listened to the recordings 
used in the analysis, wrote down the text of the item in transliteration ог in 
phonetic symbols and indicated beneath each syllable the tone he or she 
would use when speaking this syllable. Each item was transcribed in this 
manner by two to four informants. The informants did not always concur 
on the tone that should be used in speaking a particular syllable but the 
syllables whose tones were in doubt formed only 7 per cent of the total 
material analysed. 

In Figure 3, three such conflicts of opinion may be noted. These are 
indicated under syllables 12, 23 and 25. Three informants transcribed this 
item. Where two letters are found under a syllable the upper letter repre- 
sents the opinion of two informants, the lower the opinion of one. 

This little chant seems to follow quite closely the organization of tones 
given in Figure 1. Only three pitches are used. High is represented by C*, 
as in syllable 21; low by А, as in syllable 2; and the mid tone by B, as in 
syllable 7. The falling tone is represented by the pattern, C* — A, as in 
syllable 23; and the rising tone by the inversion of this pattern, as in 
syllable 3. ' 

xm es P E d associated with a falling tone the 

а | 5 is accepted and the minority opinion 
rejected. This procedure will be followed throughout the analysis, Syllables 
12, 23 and 25 are therefore considered to be associated with the fae indi- 
cated by the upper of the two letters found below the staff. 

In order to determine the degree of coordination found between the 
speech melody and the song melody in this example we shall compare the 
pitch level of speech and song of each syllable with that of the 5 ding 
syllable and with that of the following syllable. This » E f й T sario 
can be made with register tones only, since different opi ts m AS the 
relation of the initial and final points of the xm di ar Т and 
rising, to the three register tones and to each other. Foll ones, fa ud d 
we see that although syllable 1 in Fi SCENE this me E 
SS : igure 3 is recited on the pitch C^ 

greater portion of the other mid tones in the ited 

í РР ехатріе аге гес! 
2 the pitch B, syllable 1 is in Proper coordination with syllable 2 since 
, representing the mid tone, is higher in pitch than А, representing the 


low tone. The pitch of the tones in i 
í the р sity 
relative rather than fixed, eria d 


1. The informants were Kingkeo Atta; А it 
Sat ӨК renes and Kauds Thé Hace Семь Chaiyaratana, Sakon Changsant 


Й 


266 Intonation and Music 


There is no syllable to consider preceding syllable 1. Syllable 2 is in 
proper relation to the preceding syllable 1. Syllable 3 cannot be con- 
sidered since it is a contour tone. No judgement can be made concerning 
syllable 4 since it is precedéd and followed by a contour tone. Syllable 7, 
on the other hand, can be compared with both the preceding syllable 6 
and the following syllable 8 and coordinates with both. 

Following this method we discover two sequences of non-conforming 
or non-coordinating syllables, 11-12-13 and 24-25-26. 

Since opinions differ concerning the levels at which the contour tones 
begin and end we can judge their coordination with the musical contours 
only from the point of view of the direction in which they move. Since the 
three rising tones in this example are associated with an ascending musical 
pattern, and the one falling tone with a descending pattern, we judge them 


to conform. 

We may thus conclude that of the twenty-eight syllables found in this 
excerpt, six do not exhibit full coordination of speech melody and song 
melody according to the method of analysis used. The degree of coordina- 
tion exhibited is therefore approximately 79 per cent. 

f the multipli- 


Figure 4 is a transcription of a recording of а recitation о 
cation table by a group of children ages twelve to thirteen in the Bangkok 
times one through two times twelve. Note 


schools. The excerpt covers two Sity 
that throughout this excerpt the single and level pitch, F*, in the song 
melody is associated with the tone contour, rising. The single ог level 


pitch, C*, is associated with tone contour, falling, in two instances, 
syllables 25 and 41, and partially in à third, syllable 48. 

After each informant had transcribed the text of each chant or song he 
or she was recorded speaking this text. E 


n the spoken versions of this reci- 
tation the rising and falling aspects of the contours are carefully observed. 
According to the informants, in rapid со 


nversation ог recitation the high 
register tone is commonly substituted for the rising contour tone since the 
latter is the most difficult of the five tones to produce. There seems to be 
a similar substitution of the low register tone for the falling contour tone 
although this substitution does not seem to occur with equal frequency. 
Meaning is apparently then grasped by 


context. We must therefore assume 
that there are two acceptable forms of the rising tone and two of the 
falling tone. 


One informant, Miss Chalao Chaiyaratana, has given the writer a more 
detailed analysis of the manner in which this substitution is structured. 
According to her observations, it is made only when the syllable has no 
assigned meaning when uttered with the substituted tone. Thus, the first 
syllable, song, has meaning when associated with the rising tone but not 


with the high tone. The high tone may therefore be substituted and the 


George List 267 


ЫЛЕ ЖЕЕ. 


Figure4 Recitation ofthe multiplication table 


meaning associated with the rising tone transferred to the previously 
meaningless utterance at the high tone. Syllable 36, yi, has meaning if 
associated with a falling tone, none if associated with a low tone. The low 
tone may therefore be substituted for the falling tone. These are apparently 
the common substitutions. This informant knows no instances, for 
example, of the substitution of the mid tone for the rising tone. 

Further evidence of the substitution of the low tone for the falling tone 
will be seen in the last phrase of the recitation. Here the children differ in 
their interpretation of syllable 48, yi. It is recited by part of the group Їй 
the manner of syllables 14 and 29 and by the remainder in that of syllables 
35 and 41. 

Having accepted the substitution of the high and low register tones fot 
the rising and falling contour tones as a characteristic of Thai recitation» 
and possibly speech, we find only two syllable sequences in this ехсегрі 
which do not exhibit coordination of speech melody and song melody: 
These are syllables 33-4 and 39-40. In these a change of tone from low {0 


\ 


268 Intonation and Music 


| 
mid does not produce а corresponding change of musical pitch. Adding 
these to syllable 48, in which the reciters themselves are inconsistent in the 
use of song pitches, we have five syllables out of the total of fifty showing 
lack of coordination. This excerpt therefore exhibits a 90 per cent coordi- 
nation of speech melody and song melody. 

The next transcription is also of a recording of a recitation in the Bang- 
kok schools. The reciters were two young men in their late teens. Their 
recitation is of a series of poems in a rather complex classical form known 
as klong. In this case each poem contains only one stanza. The excerpt 


consists of one full stanza. 
In Figure 5 note the dow 

9, 15 and 17. The glide at 15 is mu 

three. This glide, at least, is not relat 


nward glides from high tones in syllables 5, 
ch more pronounced than the other 
ed to the necessities of speech tones. 


19 20 ~ 


16 17 18 


Figure 5 Literary recitation, Klong 
\ 


It is a form of intonation used as emphasis. Thus, intonation exists to some 
degree in Thai in addition to tone. The only other type of intonation 
apparently in common use is that of gradually dropping the general pitch 
level throughout a phrase ог sentence and then returning to the original 
level as the next phrase is begun. This intonational practice does not seem 
to have affected the chants or songs analysed. у | 

In discussing the relation of tones and intonation to Gies and song in 
Chinese, Chao (1957) makes a distinction between ‘chant’ and sing-song’.? 
The term ‘sing-song’ is applied to а form used by vendors and by children 
in the lower grades in China and is characterized as a stereotyped form of - 


2. Chao's discussion is principally concerned with the Mandarin dialect which has 


four tones. 5 


Сеогде List 269 


speech ог recitation in which intonation plays no part. On the other hand, 
chanting of literary poems in the upper grades (a practice now dying out) 
apparently takes intonational phenomena into consideration. Although 
both Chinese ‘sing-song’ and ‘chant’ are improvised, only the latter 
requires training for proper rendition. 

Since in the distant past the Thais migrated south from China and since 
some Thai groups are still found in southern China it is not surprising to 
find practices parallel to those of China. According to the Thai informants 
assisting in this study, the type of recitation used in the alphabet and the 
multiplication table is purely an aural tradition. The children receive no 

_ instruction in this type of recitation. The recitation of the klong, on the 
other hand, is taught in the schools. 

In Figure 5 we find once more, in syllable 25, the substitution of the 
high tone for the rising tone, or, to state the matter more accurately, the 
use of the highest pitch of the song melody in association with the rising 
tone of the speech melody. Note that in both Figures 4 and 5 the highest 
pitch of the song melody is substituted for the rising tone. When a low 
tone is substituted for the falling tone, as in Figure 4, this low tone is the 
lowest musical pitch found in the chant melody. 

The next example is an excerpt from traditional Thai classical song. ; 


LE E 
GERS 1 SE 
ch 


- Figure 6 Song in classical style, ' Mountain Breeze' 


This excerpt illustrates a phenomenon common to Thai song, the use of 
the nonsense syllable, ey, as a filler between sections of text, In the clas- 
Sical song continuants are also used. In this usage the enunciation of ? 
syllable is completed and the singer continues for a brief. period on the 
nasal, ng, before enunciating the second syllable. In Figure 6 nonsense 


270 Intonation and Music 


syllables are indicated by a lower ca: i 

The text of this enr based im as ЕСЕ D i eds 5 

have been set to the same musical melod SEE tes ci Hr 

: ly. The tune also exists indepen- 
dently of text and is used as the basis of instrumental improvisation. 
Except in the case of the modern popular songs, the Thais claim not to 
compose new melodies but to utilize a stock of traditional tunes already 
in existence. They find no need to develop new melodies since the stock 
of pre-existing tunes seems quite adequate for their purposes. In setting a 
text to a pre-existing melody an attempt is made to select speech syllables 
whose tones coordinate with the contours of the musical melody. A num- 
ber of Thais questioned over a period of several years have made state- 
ments similar to the above. This is apparently a fairly stable cultural point 
of view. 

This approach to composition offers some parallels with traditiona] 
methods of composition used in China (Levis, 1937). Chinese musicians 
first composed their melodies as a generalized series of rising and falling 
contours plus reiterated pitches, using a special notation for this purpose. 
Words were then set to the composed melody in such a manner that speech 
and musical contours coordinated. Specific musical pitches were later 


selected from the various Chinese classical modes. 
It is difficult to determine the degree of coordination of speech melody. 


and song melody existing in this excerpt since the register tones are 
separated not only by contour tones but by nonsense syllables and con- 

tinuants. Since the last two are meaningless, there are obviously no specific 

tones associated with them. The first five syllables coordinate well. 

Syllables 7 and 8, if not separated by a continuant, would certainly not 

conform, The mid tones of syllables 9 and 10 are separated by approxi- 

mately a quarter tone. Later in the song are found falling tones associated 

with a descending and rising melodic arc and rising tones associated with 
the reverse contour. Since the poem is set to a pre-existing melody it is not 
to be expected that the degree of coordination would be very high. 

In pursuing this study it seemed useful to secure some data possibly 
more objective than those established by the human ear, specifically, 
spectrographic evidence. Professor Fred Householder of the Indiana 
University Linguistics Program was kind enough to offer assistance in 
this direction by preparing spectrograph analyses of both the music and 
the spoken text of a short Thai lullaby. The informant learned this lullaby 
from her nurse, who had no formal education. Other Thais remember 
having heard the same lullaby as children but in varied forms. It can thus 
be assumed that this lullaby is à traditional folksong in the commonly 


accepted meaning of the term (see Figure 7). 
Figure 7 shows a short section of the transcription of the song melody 


George List 271 


Figure7 Lullaby, ‘The Mother Crow’ 


in musical notation, the spectrographic transcription of the same section of 
the song, and the spectrographic transcript of the same section of the text 
when spoken rather than sung. The nonsense syllable, ey, is also used in 
this lullaby. It occurs at the end of this musical phrase but here has been 
elided. The nonsense syllable is not used in speech and therefore does not 
occur in the spoken version of the text. 

Although the graphs of speech and song are remarkably alike through- 
out the lullaby, two principal differences can be noted. Rises are much 
more abrupt in song than in speech and the range of speech is very much 
greater than that of song. Each set of four short lines to the left of the 
song and speech graphs represents a range of an octave, divided into thre? 
approximate major thirds. The ranges of the song phrase and of the speech 
phrase as shown in the graphs in Figure 7 are as follows: 


Lowest pitch Highest pitch 
Song fë Gr LG 
Speech р cht 


The range of the song phrase is therefore a perfect fifth and that of {© 
speech phrase a minor sixth. The total range throughout the song an 
Spoken text shows a greater contrast. 


272 Intonation and Music 


Lowest pitch Highest pitch 
Song f eb! 
Speech A е?! 
The total range of the song is thus а minor seventh while that of the 


spoken text is an octave plus a diminished fifth. 
Again using the spectrograph Householder developed a schematic 
organization of the tones used in Central Thailand. These are shown in 


Figure 8. 


Figure8 


of contours shown in a minimum of ten 
f forty. They are based on syllables both 
xt. The short lines to the left again repre- 
divided into four minor thirds. 

The schematic organization of tones based on spectrographic evidence 
differs considerably from that offered by Haas, Figure 1, or by Swat, 
Figure 2. The spectrograph shows that none of the register tones, mid, low, 
nor high, are level. Mid and low move down, high ascends. The difference 
in relative pitchibetween mid and low is quite small. The rising tone, it 
appears, does not actually ascend but dips and returns to approximately 
the same point. It is thus apparent why the Thais feel that the rising tone is 
the most difficult of the tones to produce. 

Householder found two distinct forms of the falling tone, one a descend- 
ing contour and the other, the only level tone of the group, appearing at 


These graphs are averages 
utterances and a maximum О 
uttered in isolation and in conte 
sent the range of an octave, but 


George List 278 


the highest pitch. Thus there seem to be three forms of the falling tone in 
use, a descending contour, a low supposedly level tone, and a very high 
tone. 
Should the initial half of the rising tone contour in Figure 8 be elided, 
the remaining part of the contour somewhat resembles the high tone. A 
similar elision of the first half of the falling tone would in turn produce a 
Ч contour somewhat resembling the low tone. This may perhaps explain the 
` use of high and low tones in rapid conversation and recitation to represent 
the rising and falling tones. 
Further information concerning the contours of the rising and falling 
- tones can be gathered from the complete graphs of the song and spoken 
text of this lullaby. The rising tone occurs four times in the song. In both 
-speech and song the rising tone three times exhibits the dip and return 
contour shown in thespectrographicanalysis of the individual tones. Once, 
in both song and speech, it takes the form of a rising contour without the 
. initial dip. However, song and speech are not consistent in usage. In 
= Syllable 11 the speech shows the dip and return while the song shows ап 
. ascending contour only. In syllable 15 this relationship is reversed. 
P The falling tone occurs eleven times in the lullaby. In speech, almost 
| without exception, there is at least a slight rise before the fall. In song 
4 this preliminary rise occurs less frequently. The descent in song is much 
more abrupt than in speech. The one example of the use of the falling 
| tone as а very high level tone appears in both Speech and song and is 
М associated with syllable 15 in both cases. This is shown in Figure 9. 

. From this example it would seem that the falling tone when preceding 
another falling tone may take on the character of a high level tone. How- 
ever, the matter is obviously not this simple. Two falling tones also appear 
in succession in Figure 7. Both are associated with descending contours. 

The fact that the range of speech is larger than that of song is shown 
more clearly in Figure 9 than in Figure 7. d 

Two quite different methods can be applied їп studying the degree of 
coordination of speech melody and song melody. The first would involve 
making spectrographic analyses of the recitations and song, and of their 
spoken texts, and comparing the graphs thus produced. The second would 
Я involve the transcription of the texts by the informants, transcription of 


the music in notation by the scholar, and the comparison of these results 
arrived at purely by ear. The results achieved by the first method would be 
acoustically accurate but not necessarily culturally valid. In the first place, 
it has been established that the human mind and ear make distinctions 
= wherethespectrograph makes none (Hockett, 1955). Thereverseis of course 

obviously true. In thé second place, although the Thai informants used in 
the analysis have a high level of education, they have in most cases only the 


. 274 Intonation and Music 


ours and relative pitches of the 
al pattern 


includes three register tones o 
one descending and the other ascending. 
rising tone is difficult to produce there 
definition of its shape. This lack of a detailed awareness on the part of the 
informants of the mechanisms of their speech is not at all surprising. It is 
doubtful if many reading this article can make an accurate and detailed 
phonetic analysis of their speech habits in their own tongue. Is the reader, 
for example, aware of the circumstances under which he aspirates а ‘p’, 
and the circumstances under which he pronounces it without aspiration? 

Again, it is impossible for the musicologist to either hear orto indicate 
in notation all the subtle differences in pitch which are registered by the 
Spectrograph. Musical notation is, in itself, a generalization. 

When a Thai utters what he considers to be a rising tone, he believes, 
he assumes, that he is producing an ascending contour. Whether this is the 


is apparently no clear mental 


George List 275 


Е. E 


exact contour the tone takes acoustically is immaterial. When this syllable, 
using this tone, is sung to an ascending contour in song, the Thai has to his 
satisfaction achieved coordination of speech melody and song melody. 
And when the musicologist's transcription is sufficiently accurate to indi- 
cate the presence or lack of this type of coordination it suffices for his 
analysis. 

Figure 10 is a short lullaby, sung by the same informant and learned 
under the same circumstances. 


12 13 14 15 16 


n nn n n n n n n 
Figure 10 Lullaby, ‘Boat and Rain’ 


Like many lullabies around the world this one has a refrain sung to 
nonsense syllables. In Thailand a type of hammock is often used as а 
cradle and the child is rocked by rhythmically pulling on strings attached to 
the cradle. This practice explains the rhythmic and almost metrical char- 
acter of the song. It will be noticed that the refrain of nonsense syllables is 
organized sequentially but not the meaningful text, The RENS and the 
metrical organization may perhaps suggest Western influence but this little 
song shows perfect coordination of speech melody and song melody 
according to the method of analysis being applied. A 

The next example is a popular song of the present day which is based 
ona traditional classical melody. The informant who sang this song had 
heard it sung by her friends and on phonograph records issued in Thailand- 
The full song contains 29 syllables, 

As will be noted in Figure 11, there were no differences of opinion among 
the three informants who transcribed this song as to the e associate 
with the 29 syllables found in the text. Not only are the Sak set to 2 
traditional tune but nonsense syllables are used to БИ the musical ргазе“ 


276 Intonation and Music 


ке 


ESES 5] 25 


Figure 11 Popular song based on a traditional classical melody ` 
1 


h melody and song melodyis not nearly as 
. Lack of coordination, for ex- 
equences 4-5, 7-8-9 and 12-13. 


However, coordination of speec! 
high as in the recitations and the lullabies 


ample, can be seen in Figure 11 in syllable s 
Our last example is a highly acculturated popular song. The recording 


analysed is a copy of an unidentified commercial disc issued in Thailand. ` 
The song is sung by a female vocalist accompanied by а small orchestra of — — 
Western musical instruments and is cast in the common American popülar —— 
song form of 32 measures of common time divided into four sections, A*, 
А?, B and A2. Figure 12 represents the first eight measures, or Al. 


A detailed analysis was made of the entire 124 syllables heard in this 
ћ melody and song melody, as far as the register 


= aa 


Ў, pinon лез, In Section A}, shown in Figure 12, A 
SN A 
Kä coordination Is foul P n va) tone 

ШҮ E the contour (0167 a ‘eas ШЙ with бн 


ез 24 b ^ 
as ies and 26 are both ассер(й немо an 
Wen ing tones, i the materia Be wn Pe Simi 
jet aU for the falling conto ire tem Paling conto 

Vel too Wica] pitch found H, the OH und. 2 
Sito; NES substituted for eithe ical pit” DI ogue 0 
Nelo, QM With the highest pon ven? catalog?“ 
9f this о. 

popular son: 


m. Qc 293 4 5218. 7 
XEREREEREE 
Бк а 

RMH R MM F 


== ge 17 18 19 20 EE 22 23 


SE E L MFR 


24 25 26 


Figure 12 Acculturated popular song 


sociated pitches occurring in this song shows that the high tone is most 
frequently associated with the pitch A, the low tone with the pitch F, and 
the mid tone with the pitch G. On this basis we accept the association of the 
pitch Е with the falling tone and the pitch A with the rising or falling tone as 
proper coordination of speech melody and song melody. 

The contour tones in this song show much more coordination than the 
register tones. This will be noted in syllables 4, 10, 14 and 17 in Figure 12. 


Contour tones found at cadence points seem to be coordinated with 
particular care (see Figure 13). 


ES 
17 46 7411 59 


F F F 


89 90 91 92 93 
I ER = 
= SE 


"MH M E F 
Figure 13 Acculturated З pta 


218 Intonation and Music 


The handling of the contour tones in the final cadences of A?, syllabl 
59 and 124, is shown in Figure 13. Also the coordination at the p = 
formula of the B section, syllables 89 through 94 is shown. In gener: B 
ditional Thai music seems to be cast in non-repetitive musical Pe e 
may be a reflection of the strong influence exerted on music by the к 
organization of the language. Language does not ordinarily fall into even) 
balanced, repetitive groups. The triple repetition of the А section in is 
song would certainly tend to prevent perfect coordination of tones and 
musical contours. This is shown in Figure 13 in the tones associated with. 
the melodic figure sung to syllables 17, 46 and 111. 

The observations made may now be summarized. It has become obvious 
that not all genres of Thai song or recitation show the same degree of 
coordination of speech melody and song melody. Eight examples were 


analysed. The degree of coordination sho 
methods of analysis used is shown below: 


Percentage Пет 


100 VI Lullaby, ‘Boat and Rain’ 

90 II Recitation of the Multiplication Table 

79 I Recitation of the Alphabet 

78 HI Literary Recitation, Klong 

78 У Lullaby, ‘The Mother Crow’ 

66 IV Song in Classical Style, «Mountain Breeze" 

60 vill Acculturated Popular Song 

d on a Traditional Classical Melody 


59 VII Popular Song Base 
We can conclude, therefore, that in the central Thai culture there isa 
f speech and song in the 


high level of coordination of the pitch elements О 
recitations or chants used in the public schools, whether traditional or im- 


provised, and in folk songs such as lullabies where text and tune probably 
form one associated tradition. Less coordination is found in the classical 
Song where the association of pre-composed texts and tunes presents much 
difficulty in achieving this coordination and where the frequent use of non- 


sense syllables and continuants tends to fragment the verbal phrase. Still 
less coordination is foun 


d in the present-day popular song, whether an ac- 
culturated imitation of à Western model or a setting of a new text to a tra- 
ditional tune. 

In answer to the questions posed at the beginning of this article, we 
can now conclude that in chant and song found in the traditional everyday 
life of the people of central ailand, speech melody has played the most 
prominent role. Song melody has been subservient, and purely musical 
Creativity operated within à small and limited sphere. In the artistic, aristo- 


George List 279 


wn in each as determined by the j 


cratic classical song musical creativity played a much greater role, utilizing 
meaningless syllables and continuants as a basis for this musical elabo- 
ration. As the imitation of Western styles has spread throughout the culture, 
coordination of register tones with the musical contours has tended to 
diminish in degree but the influence of the contour tones upon the musical 
line seems to have retained the greater part of its force. 

The persistence of this last phenomenon, the highly acculturated popu- 
lar song, as shown in Figures 12 and 13, is an excellent example of the 
strength and stability often shown by a largely unconscious cultural trait 
during the acculturative process. This cultural trait is apparently very 
Strong among the singers of Central Thailand. 


Musical notation in figures 


Тће music is written an octave higher than heard. 

Two slurs, one above another, indicate a vocal glide or portamento. When 
the glide is found between two notes it is sung on the syllable of the first. 
When there is no note following the glide, the approximate pitch at which it 
ends is indicated by a grace note in parentheses. A glide into a note is 
indicated by a slanting arrow preceding the note and is sung on the syllable 
of the note. 

A vertical arrow pointing upwards indicates that the note over which it is 
placed is sharp, but not more than a quarter-step sharp. A similarly placed 
arrow pointing downwards indicates that the note is flat, but not more than 
a quarter-step flat. 5 


Wavy vertical lines indicate that the material placed between them is 
excerpted from context. 


References 


Cuao, Y. R. (1956), ‘Tones, intonation, singsong, chanting, recitative, tonal 


composition and atonal composition in Chinese’, For Roman Jakobson, Mouton, 
pp. 52-9. 


HAAS, M. (1956), The Thai System of. Writing, American Council of Learned 


Societies, Program in Oriental Languages, Publication Series B — Aids – No. 5, 
Washington. 


Носкетт, С. F. (1955), А Manual of Phonology, Indiana University Publications 


in Anthropology and Linguistics, Memoir 11, International Journal of American 
Linguistics, Bloomington, Chapter 5. 


Levis, J. Н. (1937), ‘Chinese music’, Asia, vol. 37, December, pp. 864-5. 


Recordings 


i 
1. Recitation of the Alphabet, recorded by Howard K. Kaufman in Bangkok, 2 
1953-4. Indiana University Archives of Folk and Primitive Music Таре No. 819.2 


2. Recitation of the Multiplication Table, recorded by Howard K. Kaufman in 
Bangkok, 1953-4. IU AFPM Tape No. 824.7. 


280 Intonation and Music 


3. 


4. 


8. 


KA 


Literary Recitation, Klong, recorded by Howard К. Kaufman in Bangkok, 
1953-4. IU AFPM Tape No. 816.5. у 

Song in Classical Style, Mountain Breeze, sung by Nang Ootoomporn Uttara. 
Recorded by Priscilla V. Magdamo, Bloomington, Indiana, May, 1959, as part of 
class project, Music U 302, “Recording and Transcription Techniques in the Study 
of Folk Music’. 

Lullaby, The Mother Crow, sung and spoken by Kanda Thammongkol. From 
sound track of television film, Music and Infancy. Recorded by Indiana 
University Radio and Television Services, Bloomington, 1958. 

Lullaby, The Boat and Rain, sung by Kanda Thammongkol. Recorded by 

George List, Bloomington, Indiana, December, 1959. 

Popular Song Based on a Traditional Classical Melody, sung by Kanda 
Thammongkol. Recorded by George List, Bloomington, Indiana, December, 

1959. 

Acculturated Popular Song, copied in Bangkok from an unidentified commercial 
disc issued in Thailand, by Howard K. Kaufman. 10 AFPM Tape No. 817.8. 


George List 281 


15 Robert А. Hall, Jr 


Elgar and the Intonation of British English 


Robert А. Hall, Jr, *Elgar and the intonation of British English’, Gramophone, 
vol. 31, June, 1953, p. 6. 


Elgar is very popular in England, and not at all popular elsewhere; in this 
respect, his case is similar to that of Fauré in France, or of Bruckner and 
Mahler in Austria. We have known this much for a long time; but why 
should it be so? On this point we have been offered various explanations, 
but all couched in terms of generalities and all basically unsatisfactory. 
During the 1920s and 1930s, a natural reaction against the late Victorian and 
Edwardian periods caused a certain amount of hostility to the more super- 
ficial aspects of Elgar’s inspiration, which were identified with imperialistic 
bombast and with the overstuffed ‘ plushy' style of the turn of the century; 
but that attitude of disdain is rapidly passing, and Elgar is no more popular 
outside of England than before. Eric Blom, in the last chapter of W. H. 
Reed’s Elgar (1959), suggests that it is ‘likely to be merely a matter of an 
ancient tradition according to which “English music” is a contradiction in 
terms’; and yet, even nowadays, when Britten and Rubbra and Vaughan- 
Williams are well liked abroad, Elgar’s music remains unpopular. Ernest 
Newman suggested ignorance of English culture as a cause: ‘There is 
something in this most English of all composers that escapes all foreigners, 
no doubt because they have an insufficient acquaintance with the thousand 
years of culture and tradition out of which the mind of Elgar has flowered’ 
(quoted by Porte, 1933, p. 98). Yet Verdi is beloved of many who know 
nothing of Italy, and the great German composers from Bach to Brahms 
are popular the world over. We are left with the vague feeling that Elgar is 
inexplicably unique, and that we can apply to all of his work what Daniel 
Gregory Mason said apropos of the ‘Nimrod’ variation (1918): ‘It is а 
striking fact that the originality of the passage (for no one but Elgar could 
have written it) is due to subtle, almost unanalysable qualities in the mode 
of composition rather than to any unusual features of style.’ 

Can we find any more precise explanation than these ‘subtle, almost E 
analysable qualities’ for Elgar's great popularity at home and unpopularity 
abroad? Purely musical analysis and attempted correlations with gener 
cultural phenomena have not succeeded so far; perhaps we should look 
farther, in previously unexplored fields. One such area, in which relatively 


282 Intonation and Music 


little work has been done to date, is that of intonation patterns in language. 
(We use the term intonation here in its linguistic sense, referring to speech- 
melody, the rise and fall of the voice in connected utterance; the term 
inflection is sometimes also used in this meaning, but linguistic analysts 
prefer to reserve inflection for grammatical variations of the type man, man’s 
men, men’s or am, is, are.) Ordinarily, in discussing language or thinking 
about it, we neglect intonation, because we take if for granted. There is a 
good reason for this, because intonation is perhaps the most deep-rooted 
and the least conscious of all aspects of linguistic behaviour. We learn the 
intonation patterns of our native language earlier, even, than its individual 
sounds, words or syntax; and when we learn a foreign language, we un- 
learn our native intonation with more difficulty than any other feature of 
our speech. There are differences in intonation, not only between languages 
but also between dialects of the same language (just listen to a Londoner 
and then to а Scotsman, or to a New Yorker and then a Texan). 

Two of the most striking features of British-English intonation, which. 
distinguish it from American English as well as from most European 
languages, are a wide range of variation in pitch and a predominance of 
falling patterns. The normal American's range of pitch is relatively narrow, 
as contrasted with that of British English; this is what gives the Britisher 


the impression that the American is speaking ‘in a monotone’, whereas the 
American thinks the Britisher is ‘singing’ rather than speaking normally. 
vely low, characterizes the end 


A falling pitch, from relatively high to relati t 
of a declarative sentence in both British and American English, and also a 


question beginning with an interrogative word, e.g. Where are you going? 
But in questions not beginning with an interrogative (e.g. Are you coming ?) 
American English and most European languages use а sharply rising 
intonation, whereas British English has the same falling pitch that it has in 
Where are you going? (cf. Jones, 1964; Palmer, 1922). As a result, the 
British pronunciation of Are you coming ?say, sounds decidedly strange and 
foreign to American and Continental ears; furthermore, the statistical pre- 
dominance of the falling pitch pattern in British English is increased by its 


use in this type of sentence. Ne у 
Now let us turn to Elgar's music and see if it corresponds in any respect 
pitch patterns. We immediately 


to these characteristics of British-English mediatel 
have commented, Elgar's melodic line, in 


notice that, as many observers 

Mason's words (1918), “shows à tendency to large leaps, often of a seventh, 
in alternating directions, giving its line a sharply serrated profile’. These 
leaps correspond exactly to the wide range of pitch variation In British- 
English intonation. Furthermore, We notice that a great many of his 
themes show apredominantly falling trend; think, for instance, ofthe main 
motives of Falstaff, the introductory theme of the Introduction and Allegro, 


Robert A. Hall, dr 283 


the first subject of the Second Symphony, and a host of others. Even more 
significant is what Elgar does in working with material whose compass is 
of a more limited range, such as the ‘Welsh’ theme in the Introduction and 
Allegro. Here, as is well known, he was using a reminiscence of a tune he 
had heard in Wales, involving a drop of a minor third. He starts with his 
tune restricted to that interval; but as soon as he begins to develop it 
further, he goes off into his customary leaps and falling trend. 

But there is even more direct evidence of the influence of speech patterns 
on Elgar's melodic invention. Reed tells, in his book on Elgar (1959, p. 75), 
how 


he had a little habit of repeating some particular word or phrase that had taken 
his fancy. ... The name of a place would please him in some way, and not 
content with repeating the word continually, he would set it to music, as for 
instance Moglio, the name of a village quite near to him when he was writing In 
the South. . . . Needless to say, the bars in the score of In the South marked with 
the word ‘Moglio’ are repeated in the music many times, just as he would keep 
saying it. 


Reed's quotation from the score shows, furthermore, the characteristic 
downward curve of normal British-English intonation. 

Мо wonder, then, that the English feel there is something peculiarly ‘all 
their own? about Elgar, which the non-English fail to appreciate. According 
to our hypothesis, the phenomenon is due, at least in part, to his reflecting 
in his music the two most characteristic features of British-English 
intonation, its wide pitch range and its predominantly falling patterns. 
Since, however, we normally have a very hard time sorting out or even 
identifying features of intonation, the Englishman simply feels an ‘in- 
stinctive’ affinity to Elgar’s music, and the non-Englishman feels its 
‘strangeness’, both of them without knowing why. Our hypothesis, more- 
over, would give much fuller content to Elgar’s somewhat mystifying ге“ 
mark that ‘music was in the air all around you and that you merely had to 
grab what you wanted and as much as you wanted’ (E. Blom, in Reed’s 
Elgar, 1959, p. 179). This was not merely a vague expression of a ‘curious 
mixture of humility and pride’ (Blom); it would be perfectly natural for 
Elgar to speak this way, if he unconsciously found the major patterns of his 
melodic inspiration in the intonation of his native British English. Not only 
was it literally ‘in the air all around him’, whenever anyone spoke, but, 
even more important, since every human being speaks all the time 10 
himself when he is ‘thinking silently’, Elgar had within himself and his 
own thoughts an inexhaustible source of his characteristic melody. This also 

explains Blom’s remark (in Reed's Elgar, 1959, p. 178) that ‘it is as though 
а composition had been for him (Elgar) like a slice of music cut from an 


284 Intonation and Music 


' 


invisible store" — just as, when we speak aloud, we are simply externalizing 
part of a continuing stream of internal, ‘silent’ speech. 

The above are, of course, merely preliminary observations, intended to 
call attention to a correlation which deserves closer attention and more ` ` 
detailed analysis. There is a whole field for musicologists, as yet virtually — 
untouched, in the comparison of melodic structureandlinguisticintonation 
patterns. It would be worth our time and effort to examine the relation of 
(say) Fauré, Debussy and Ravel to French intonation, that of Bruckner 
and Mahler to Austrian intonation, and so forth. In this way, it might be 
e of the hitherto unsolved problems of popularity 


possible to clear up som 
ational boundaries, as we have attempted to do 


and reputation across п 
here in the case of Elgar. 


References 

Jones, D. (1964), Outline of English Phonetics, Heffer. 
Mason, D. G. (1918), Contemporary Composers, Macmillan. 

PALMER, Н. E. (1922), English Intonation, Heffer. 

PORTE,. J. Е. (1933), Elgar and his Music, Pitman. 

REED, W. H. (1959), Elgar, Farrar, Strauss & Giroux. 


J Robert A. Hall, Jr 285 


1-16 


16 Ivan Fónagy and Klara Magdics 


Emotional Patterns in Intonation and Music! 


]van Fónagy and Klara Magdics, "Emotional patterns in intonation and music’, 
Zeitschrift fiir Phonetik Sprachwissenschaft und Kommunikations-forschung, vol. 16, 
1963, pp. 293-313. 


We want to describe the melodic patterns of ten different emotions ог 
emotional attitudes chosen more or less arbitrarily. Our paper is based on 
records of conversations, dramas, radio plays, as well as on experiments 
made with actors on the one hand and on vocal and instrumental musical 
compositions on the other. 

The ten feelings or attitudes are as follows: (1) joy, (2) tenderness, (3) 
longing, (4) coquetry, (5) surprise, (6) fear, (7) complaint, (8) scorn, (9) 
anger, (10) sarcasm. 


Hungarian emotive intonation patterns 


In Hungarian the pitch range is increased with joy. The level of the into- 
nation pattern is raised approximately by a third. At the beginning of each 
phrase (the stretch of speech ranging from stress to stress) the voice rises to 
a higher level than in neutral speech, this rise is followed by a sudden fall 
of a fourth or fifth, or possibly a third, and the succeeding level is either 
preserved until the end of the phrase or turned into a very slightly des- 
cending line. In the case of animated joy the ending line of the phrases may 
rise slightly (especially in women’s speech). The voice never touches the 
basic tone. In the case of mild joy the stressed syllables form a slightly 
descending line in comparison with each other; the excited joy produces 
a capriciously alternating level of stressed syllables. Joyful excitement 
often turns originally secondary stressed syllables into main stressed ones. 
‘The stresses of the phrases are approximately of equal force (independent 
of the importance of the contents expressed by words). The distribution of 
stressed syllables is arhythmical. The stress distribution as well as the 
capriciously leaping intonation practically results in the breaking of the 

1. We are grateful to Janos Ferencsik, chief musical director of the Hungarian state 
Opera House, Bence Szabolcsi, academician, and József Ujfalussy, doctor of musicolo 
who helped us greatly in the selection of the musical material. Many thanks for the 
critical remarks of Janos Maróthy, doctor of musicology. We hope our data, 0 


А Ка Б i е 
correct or incorrect statements will give an occasion for Professor Otto von Essem th 


excellent expert of speech melody, to enrich the phonetic literature with furthet 
valuable studies. 


286 Intonation and Music 


sentence into pieces. The tempo is lively; the stressed syllables are generall 
sharp and clear; audible glides, *portamento -s, occur in stressed on 


syllabic phrases (Figure 1). 


cca 86-90 


Vég-ro meg = & = kez—tekIMar hise-6 — ve 


Figure1 


Tenderness is also expressed on а higher pitch Jevel. The level does not 
fluctuate in this case. The stressed syllable keeps the phrase in а *legato- 
arc’, enclosing it so to speak; the melody of the phrase is very slightly - 
descending and ends far above the basic level. Sentences consisting of more 
phrases show a gentle undulation of the pitch level (Figure 2). The tempo is 


Figure2 


iculation is extremely soft, often 


restrained, the loudness reduced. The art 
unds ‘full’. 


labialized and a little nasal. The voice 50 


and stress curves run 


ined. The melody с 
; from here on loud- 


Parallel until the stress minimum following the peak 


ice production had been previously investigated 


2. The in d 
fluence of emotions on УО! 
Fónagy (1962). 


by 
means of tomographic records, cf. 


Ivan Fónagy and Klara Magdics 287 


ness is more and more diminished while the melody rises (Figure 3). The 
voice production is breathy, sometimes turning into whisper. The in- 
nervation of the muscles taking part in the expiration is slightly increased in 
the course of the sentence. 


is lát- hat-nám ... 


Figure3 


The melody expressing a coquettish invitation moves on the mid-level 
(or even lower). After an even, mostly ‘melodic’? central part, the last 
syllable glides up about one third in an audible ‘ portamento’ without any 
increase in loudness. The first syllable generally has a stronger stress ac- 
companied by an up-glide. Despite the stress, the emphasized syllable is 
generally whispered. In the last syllable the voice often changes from a mid 


register into a head tone. The tempo is lively, the phrasing is staccato 
(Figure 4). 


—rek ma es — te. 


Figure 4 


In surprise the voice suddenly glides up (or up-and-down) to a high 
level within the stressed syllable, then – according to the kind of surprise – 
falls to the mid-level (joyful surprise) or to a lower level (stupefaction) 
leaving the sentence melody unclosed. The beginning of the phrase bears а. 


3. An attempt to define *melodicity* of speech and to distinguish the different grades 
had been made in a previous study (Fónagy and Magdics, 1964). 


288 Intonation and Music 


strong stress, the following syllables run down weakly. The tempo is 
restrained. The voice is breathy (Figure 5). ' 


hogy ke-rülsz i— de?! Azt mond-tad el — u — ta—zol. 


te, 


Figure5 


Among the different manifestations of fear, sudden fright and anguish 
will be mentioned. 

The typical intonation form of fright is similar to that of surprise. The 
stressed syllable is likewise followed by a sudden fall, but the pitch range is 
essentially narrower in this case. The intonation form is on a lower level (it 
remains in the mid-zone), the unstressed syllables are arranged in a straight 
line, sometimes with a slight melodic rise (about a semitone) in the second 
part of the phrase. The tempo is lively, the loudness reduced. The voice is 
very breathy, often hoarse, the articulation is tense. Horror is generally 


pronounced in chest tone (Figure 6). 


Је 96-98 


KK ЕА AR 
ANNI 


KC? 
Lea 
mi há-zunk e-lót dil—tak men 
cca 82-84 


Figure 6 


А prolonged state of fear, i.e. anguish, is first of all characterized by an 
extremely narrow pitch range. The melody of the stressed syllables rises 
about a semitone and returns to the mid-high level where it becomes, so to 
Speak, paralysed (Figure 7). Wi 

The most characteristic manifestation of complaint is a more or less 
‘musical’ intonation floating on one level and ascending a semitone at 
regular intervals. In the case of monosyllabic words, stress is accompanied 


Ivan Fénagy and Klara Magdics 289 


' 


GEET EES, 
KH 
E AE Yn e E, 


Lé—pé -se—ket hal-lok а Кет fe—lól. 


> > 


KN 
Vë ES чр Nh Nh Е И ~ 
I Aë Ve Аа. dg 091-1 N— NI 


Figure 7 


by an off-glide (‘portamento’). The tempo is restrained. The stress 

distribution is as equal as the melody itself. More exactly, the melody 
- ascends at rhythmic intervals, following the periodic contractions of the 
expiratory muscles. The voice production is normal, though the vocal cords 
are more compressed than necessary (Figure 8). 


ny > > > > 
D SSS SSS ee 
s es ġp fø А 


n Scorn is reflected by a more or less even and finally slightly descending 
melodic line intoned on a very low level (Figure 9). The stressed syllables 


P Je 80 
d => > > 


ABEE ES. 
к ө! рә иа 
Mere SAA AY AIR ЖЕЗ} 


Figure9 


are often lengthened; in that case the glide becomes audible. The loudness 
is reduced despite the high tension of the expiratory muscles. The vocal 


290 Intonation and Music 


| 


D 


cords are compressed. The pharyngeal cavity is greatly narrowed. The 
articulation is tense but unrounded. The tempo is slow. Scorn is GE 
sounded in chest tone. 

Anger is generally expressed on a mid pitch-level and is characterized by 
astraight, rigid melodic line leaping up a fourth, a fifth or a sixth interval at 
the beginning of the phrases. The stressed syllables ascend frequently and 
rhythmically. Some syllables — which bear at best a secondary stress in 
neutral speech — appear with main stress in a hot-tempered dialogue 
(Figure 10). The voice production is often imperfect, breathy. Loudness is 


је 120 
> > > 


Figure 10 


e maximally tightened expiratory 


not in proportion with the activity of th 
There is an equal tension in the 


muscles. Articulation is also very tense. 

laryngeal area. 
Sarcasm is concentrated in the ‘portamento’ of the stressed syllables 
lation is tense but illabial, 


e arc’. The articu 
(Knarrton). The stressed 


gliding to a low level in a *wid 
purring 


the voice is compressed or grumbling, 
syllables are lengthened (Figure 11). 


Figure 11 


German, English and French emotive intonation patterns 
The intonation forms reflecting emotions are conventional, bound to 
language and age. They сап hardly be transplanted from one language into 


Ivan Fónagy and Klara Magdics 291 


another, any more than melodic patterns having a grammatical function. 
Nevertheless this does not mean that the affective intonation forms are 
arbitrary. The arbitrary signs are necessarily conventional, but the con- 

_ ventional signs are not necessarily arbitrary (Fónagy, 1956). If a certain 
emotion is expressed by similar melodic patterns in non-related languages. 
then intonation must not be considered as arbitrary. 

In order to control this suggestion we recorded (after having defined the 
situations) certain sentences spoken by two speakers of German, two of 
English and two of French, which sentences exactly corresponded to the 
above Hungarian sentences from the viewpoint of the emotion or attitude 
reflected. 

| Joy increases the pitch range in each of the three Indo-European lang- 

uages, it is reflected in a higher pitch level, in a melody ascending fre- 
quently and at irregular intervals as well as in an irregular stress distri- 
bution. The stresses are approximately of equal force and are independent 
of the semantic importance of the words. The tempo is lively. Articulation 
is sometimes breathy (according to Trojan, 1952, p. 192, joy, exultation, 
is not characterized by a breathy articulation). 

In French - in accordance with the oxyton tendency of stress distri- 
bution – the voice rises at the end of the phrases. So the joyful melody has 
in French sentences a crescendo character as against the descrescendo 
character of the Hungarian sentences. But if we do not directly compare the 
Hungarian and French affective speech, if we relate first the affective form j 


Ich bin so glück-lich daB du da Hat Wielch mich freu—ol 
cca 84-86 $ 


Figure 12 


to the neutral form, enumerating the characteristic differences, the corres- 
pondences become obvious (Figures 12, 13, 14). In French the great 
lengthening of the stressed syllables plays an important part in the ex- | 
pression of joy. In Hungarian this possibility is limited by the phonemic 
character of duration. i 

The differentiative features of the Hungarian intonation of tenderness 
appear in the Western languages as well. The pitch level is higher than in 


292 Intonation and Music 


МУ / 


Оһ, І ат so glad to ses you, you vano i-de—a, really Ihave waited for you for hours | 


Je 4-8 


Figure 13 


neutral statements. The pitch range is narrow. The stressed syllable *em- 
braces’ the phrase even in the French sentence (‘reposes-toi un peu’). In 
the long stressed syllables the off-glide (‘portamento *) is audible (especially 
in the English and German sentences). 


мог! Je пе pensaispas te ren-con-trer | 


Commeje suisheu-reuse de te 


KK 
IL bE) Rv СЕ 


Ges: À 
T, he 


Figure 14 


The tempo is restrained, loudness sustained, the articulation 15 soft, 4 
slightly nasal, labial, the voice sounds ‘full’ (Figures 15, 16 and 17). АП | 
these correspond to Trojan's statements (1952, p. 178). A turn into head D 
tone was not found in our Hungarian and foreign experiments. 


bist. Kommt. mb dich aus. 


mü-de du 


Figure 15 


t= red my dear Have some restl 


Figure 16 


The melody configuration (cf. Bolinger, 1949) reflecting longing, namely, 
the slightly rising, descending, then at the end of the sentence gently as- 
cending melody likewise appears in German, English and French. The 


——————— EUM TM aec д 
= EE 
See e e ушл шщ E E EE, É 


bien fa—ti—guée.  Ro-poso—to— 


жи ыш ишш Е ја ЈА 


Figure 17 


divergence of melody and stress distribution is as characteristic in these 
languages as in Hungarian. The pitch range is narrowed, the tempo is 


Wenn ich jetzt nur bel 


ihr 
је 82-84 


sein Кӧпл-19.. 


> 
KAS 
Kä KC? > on we 
IS 
[ec a] KH 


Figure 18 


restrained, the voice is breathy (Figures 18, 19 and 20). The increasing 
muscular tension — mentioned by Trojan (1952, p. 181, Бе 
T <’) - is obvious. 


. 294 Intonation and Music 


w 


NU 


SACR 7 
X KR 

ASS eren Ce Leen hh Een e mg 
LIB ET Ale Hh 


p US 5 Ue E 
Ter pU 
p— p — p — ы See 


fm pres do ful... 


тшш 
N N mm 
Di 4 А а rg bz 


Figure 20 


The stressed, nevertheless soft tertial up-glide of the last syllable 
Characteristic of coquetry was never absent from the sentences. In the 
German sentence there are two up-glides (Figure 21). At the beginning of 


eu =tə Шо фә өз sichma-chen.. 
82 


Figure 21 


оду makes а fourth interval step upwards, 
Without « > (Figure 22). In the English sentence the melody rises 
Se Ee the neutral ian mie Ee ER y 
imes in con > (Figure 23). The of the 
п 4h ‘portamento ` 
nto’ and finally with D 


the French sentence the mel 


Ivan Fónagy and Klara Magdics 295 


Р) E KA af TH VAT У um v1 


Je suis lib-ra сө soir... 


Figure 22 


sentences are similar. The articulation of the stressed syllables is relatively 
tense, but loudness at the same time is suppressed, with voice turning into 
whisper. The change into head tone was clearly felt at the end of the 
English sentence and in the first gliding syllable (‘sich’) of the German 


am not re—ally do-ing a-ny-thing to night..» 


Figure 23 


sentence. Trojan (1952, p. 182) does not mention a change into head tone 
in the case of ‘luring’ (Lockung). 

Surprise increases the pitch range in the three Western languages too. 
In German and English the voice falls a fifth or a sixth interval from a high 
level (Figures 24 and 25). This sudden fall is to be found in French, but the 


296 Intonation and Music 


KE ch ECK 
NES, LEE TEE луд ШАС) 
јаја = LIS fer bes == шеша ЁЛЕ Fees Ass 


hhh - 
n R ech ` К " S imm К = 
7. 9 ЫШ elle УУ 4 — 


Figure25 


reversed movement, the sudden rise, is similarly frequent. Tn yes-or-no- 
questions surprise is reflected in an increase of the rising interval (Figure 
26). According to Trojan, surprise (Verwunderung) is characterized by an 


Toi 


је 86-90 


Figure 26 \ 


п and dynamics; he sees a relationship 


i tensio : 
increase and decrease of tion and that of surprise (p. 


between the intonation of the yes-or-no ques 
182 and ff.). 

The intonation of sudden fri 
from the melody of surprise in 
checking of loudness and speed 


fright differs also in the Western languages 
having a narrower pitch range, in the 
and in its peculiar timbre. In the case of 


је 98-104 


Haus stoh'n де — blio — ben. 


surprise, astonishment, the melody of the phrase floats оп a high level and 
in the last syllable it falls to a lower one. In speech expressing fright the 
melody does not stay long on the high level; the greater part of the 
phrase constitutes a straight melodic line on the low level (Figures 27, 
28 and 29). 


lis s'ar-rà—tent jus—te — де— vant not—re porte. 


Figure 29 


Anguish is characterized by an extremely narrow pitch range in the 
Western languages too; the stressed syllables ascend only one tone or a 
semitone from the rigid melodic line sounded on the mid-level (Figures 30, 
31 and 32). 


cca 96-98 


Was ist депп des? ` Wer kommt депп da? 
cca 84-86 | 


Figure 30 


298 Intonation and Music 


Бүз ниң een ees Ed E EE See 
ES (Тш “уина n peg Vso E e a (ees es: 
а а 


1 thinkthere is some—bo-dy mov—ing a— round there. 


2 


=a 
AN] 
EE | 


du jar - din. 


Figure 32 


on the same principles as in Hun- 
lody is rhythmically interrupted 
The laryngeal and pharyngeal 


The intonation of complaint is built 
garian. The ‘musical’, ‘smooth’ me 
by semitonic rises (Figures 33, 34 and 35). 
muscles are tense. 


fen, doch niemals fand er Zeit da — zu. 


o ofthab'ich ihnschon ge-be-ten mirzu hel- 


d by a narrow pitch range and a compressed or 
like in Hungarian (Figures 36, 37 and 38). 
be one of the most significant marks of 


Scorn is characterize: 
grumbled voice production, just 
Trojan considers chest tone to 


һап Fónagy and Klara Мада са 299 


|; 
» 


је 80-1 82 


FEERFEREUEFEREEEFEHHEE 


..there is по tech-ni-cian,themachinesbreak down, youmust spend half anho-ur wai-ting a-round. 
је 84-86 
> 


i > 


Quede fois ai-je demandé се service, тај elle atoujours ` ге— Ти – sé. 
ke 90 


Figure 35 


J= 78 
> > 


würd' ich mich nicht ein—mal zei — gen. 


Figure 37 


300 Intonation and Music 


је 84 
> d > > 
KEE 
E шан жишш тс тщз E EE E E этш 
dl 


sor—ti-ral ja—mais en sa com- ра – gnie. 


Figure 38 


Scorn (1952, p. 187). We did not come across breathy articulation (Trojan, 
lc.) either in Hungarian or in the course of our foreign experiments, 


Je 128 
> 


> > 
(ys —— S-—R— hh 
аА 4L 


Figure 39 


Anger in German and English appears – as р Wee гар zin fourth, 
fifth or sixth ascending intervals of the stressed Syf abes, frequently in- 
terrupting the straight melodic line. In English there is an audible but 


Je 130 


| tell you youre wrong! | tel you you are wrongl 


like that! 


у Em ed stressed syllables (Figures 39 and 40), 
un Cv иш де an octave at the end of the phrase 
nch the vol 


Ivan Fónagy and Klara Magdics 301 


”4 
~ 


d pi M APRN тј и 


ҮТТЕ. 


сса 140 > E > > 


= 


Figure 41 


(Figure 41). Trojan stresses the extremely strong dynamics of anger, the 
breathy voice production, the ‘heavy’ chest tone (which may sometimes 
change into head tone as a result of great terision, p. 188). 

Sarcasm is felt in the checked, * widely-arched', stressed off-glide, in 
the creaky voice as well as in the nasal timbre. The tempo is restrained. In 
English and French the ‘portamento’ is generally characterized by a wider 


$ 250 
200 $о1 
150 
0 10 20 30 40 


Figure 42 


arc (with a fifth or sixth interval) than in Hungarian. The rising up-beat 
characteristic of sarcasm in German (‘So!’ Figure 42) appears to some 
extent in Hungarian (* Úgy!’ Figure 43) (Figures 44, 45, 46). 


Emotional patterns in music 


It seems that similar emotions, attitudes, are bound to analogous melodies 
in languages not interrelated. The musical signals of emotions may Бе 
considered as panchronic tendencies standing above languages and ages, 


302 Intonation and Music 


с/з 


250 - 


200 n 


40 


fei—ner Кеп, kann тап schon $а— gen. 


plea — sant eve- ning, } must say. 


Figure 46 


which are realized according to the prevailing structure of the different 
languages.* These tendencies surpass not only the range of the different 
languages but also the limits of verbal messages. They prevail even in non- 
verbal communication. Emotions are expressed in European vocal and 
instrumental music by a melody configuration, dynamics and rhythm 
similar to those of speech? 

Joy is associated with a lively tempo, with short motives. The back- 
bone of the motive is the melody suddenly rising from a lower level (cf. 
Mattheson (1954) * Ausbreitung unsrer Lebensgeister’, p. 16). This rise, ОГ 
surging upward, can be found even within the greater units as e.g. in 
J. S. Bach's Whit-cantata (‘Mein glàubiges Herz, frohlocke!’), in the 
bass aria of the third cantata (‘Empfind’ ich Hóllenangst und Pein, 50 
muß im Herzen ein Freudenhimmel sein?) in Handels oratorio Joshua 
CO had І Jubal's Іуге!”), in the third movement (Das Wiedersehen) of 
Beethoven’s sonata Opus 81a, Les Adieux, as the main theme, as well 
as in Schumann's wedding-song (Helft mir ihr Schwestern, Frauenliebe 
und Leben Opus 42). In the third act of Wagner's Tristan the slow series 
of ever-descending motives (Figure 47a) turns into the leaping melody 
of the shepherd's pipe announcing the good news (Figure 47b). Kurwenal’s 
joyous exclamation seeing Tristan awaking in the third act (Endlich! 
Endlich! Leben, o Leben!) is less stylized and stands nearer to the 
intonation of joy. 

Tenderness brings about gently undulating melodies in a relatively 


4. B. Chaitanya Deva, examining some emotional intonation patterns in Dravidian 
(Telugu), has come to the conclusion that only the pitch level changes significantly with 
emotion. No ‘inflectional’ (configurational) characteristics could be revealed bY 
second degree equations considered by the author as the best means of expressing 
melody plots. 

5. D. Cooke, on the basis of abundant musical material, has come to the conclusio? 
that the different emotions are characterized by peculiar melody patterns. His 000 
(1959), unfortunately, was not available to us at the time of writing this paper. 


-304 Intonation and Music 


morendo 


Figures 47a, b 


narrow pitch range as e.g. in Orfeo’s aria, from the first act of Monteverdi’s 
Orfeo (‘E più felice, Pora Cheperte sospirai’), in Orfeo's aria from the 
third act (‘Vi ricorda, о boschi ombrosi...’); we recall the duettino 
number 7 of Don Giovanni and Zerline (Don Giovanni by Mozart, first act, 
Figure 48), or Zerline's aria number 13 (‘wie ein stummes Lammchen 


Don Juan 


Gib mir dio Hand mein Lo — ben, komm In mein Schloss mit тїгї 


Figure 48 


1942, p. 111). Zerline's aria number 19, Don 
Giovanni's serenade, Wotan's farewell from Brünnhilde (Die Walküre by 
Wagner, third act, ‘Der Augen leuchtendes Paar .. : » Pelléas's words 
spoken to Mélisande in the fourth act of Pelléas et Mélisande by Debussy 
(‘Oh!qu’as tudit, Mélisande . . -’), Wozzeck entreats Marie to stay on with 


such a motif in the third act of Alban Berg’s opera (“Du sollst da bleiben, 
Marie , . .’), Embracing, gentle, legato melody curves кене the tender 
feeling in Euridice’s aria in the first act of Monteverdi s Orfeo s non diro 
qual sia nel tuo gioir, Orfeo . - +) in ешш ат SUR: (Bor 
Giovanni, first act: ‘das kannst du nicht . . - Hd RELY. "Figure 
duet (Die Zauberflöte by Mozart, second act, | аа EN E 
49), in Brangene's affectionate words to her d a *. in G Se 

denet geet. act, «Herrin, Isolde, trautesie = у E 7 2 eee 
Words addressing Mélisande in the first шо ege Se 
Debussy (*Donne-moi tes deux petites mains’). 


leiden . . .*, cf. Jouve, 


Ivan Fónagy and Klara Magdics 305 


1-17 


Ратїпа Tamino 


БЕ 


Та – т!— no meinl О welch’ ein Glick! Pa— mi — na mein! О welch'ein Glick! 


Figure 49 


These melody forms go far beyond the demonstration of tenderness in 
certain periods of European music. The so-called ‘dolce’ style is practically 
a general trend in rococo music. 

The characteristic musical form of longing is a slightly descending, and at 
the end of the motive, a gently rising melody. The tempo is strongly re- 
strained. Dynamics are first similarly restrained, then slowly increase, and 
at last — in contradiction with the melodic line — decrease again (cf. рр. 
287-8, 294). 'This melody occurs in Orfeo's aria from the third act of 
Monteverdi's Orfeo (‘Al mio languire . . .”), in Tamino's picture-aria (Die 
Zauberflöte, first act, ‘Ich fühl’ es...’ cf. Ujfalussy, 1961), in the second 
movement (“Die Abwesenheit") of Beethoven’s sonata Opus 81a in E flat 
major, Les Adieux, as a main theme, in Beethoven's song Ап die ferne 
Geliebte, in Schubert’s song Gretchen am Spinnrade (‘Sein Handedruck 
und ach, sein Kup. . .’), in Schumann's Tráumerei, in Brahms's song О 
wuft' ich doch den Weg zurück, in the kiss motif of Verdi's Othello, 


Langsam und schmachtend 


Figure 50 


in Wagner's Tristan-prelude (Figure 50). Mélisande sighs deeply as she 
sees her ring falling into the fountain in the second act of Pelléas et 
Mélisande by Debussy ( Elle est si loin de nous . . ."). 


Coquettish invitation, flirtation, luring, has a slightly descending, then 
with a last short (generally staccato) tone, ascending, musical form. Such 
motives accompany the offering and luring gestures of the girl in Bartók's 
A csodálatos mandarin (The Miraculous Mandarin), as e.g. when she lures 
the old cavalier, or later in her flirtation — first discreetly and then vehe- 
mently- pursued by the mandarin (Figures 51 a, b). This coquettish move- 


306 Intonation and Music 


Figures 51a, b 


ment is *softer', more playful and gentle in Susanne and the count's 
duet in the second act of Mozart's Figaro (‘die sich gar schnell vergibt . . .’), 

A melody suddenly falling from a high level to a low one may express 
surprise in music too. Zerline expresses her astonishment in this way when 
She sees the abandoned Elvira asking for Don Giovanni's life (‘Sie wünscht 
Sein Leben?’); the same motif expresses her amazement at the news of 
Don Giovanni's damnation. This pattern occurs in Онамо and Masetto's 
motif ‘Che impesata’ (cf. Ujfalussy). Golaud gapes in astonishment with а 
similar motif at the sight of Mélisande's beauty (first act of Pelléas ег 
Mélisande by Debussy, “О vous êtes belle!’) and so with Bartók's Judith 


Seeing the jewels in Bluebeard’ 5 castle (Figure 52). 


Figure 52 


azement, Very much resembles that of horror, 
the melody grows rigid and moves 
flected in the monologue of Boris 


The motif of surprise, am 
After a sudden fall from a high aes 3 
tween narrow limits. Horror is t 


Ivan Fónagy and Klara Magdics 307 


tortured by visions (Boris Godunov by Moussorgsky, third act, Figure 53), 
as well as in Marie's alarmed cry (‘Hilfe!’, third act, second scene of 
Wozzeck by Alban Berg), or later, second act fourth scene, in Wozzeck's 
half-mad phantasies (‘Der Mond ist blutig . . ."). 


Di- tja o-kro— уау ~ Іеп– no — e  vsta— et’ 


Figure 53 


Horror often turns into anguish; it becomes lasting as e.g. in Leporello's 
part when the statue appears in the door. The range of pitch narrows from 

a seventh to a second (Don Giovanni, second act, * Ach nimmer oh mócht 

* ich solche Gäste doch nimmer seh'n ..."). That is how Papageno sings 
hearing about Sarastro's arrival (Die Zauberflóte, first act, Figure 54, cf. 
Ujfalussy); such а motif accompanies Varlam and Misajl’s stealing out of 


D  würichel-ne Maus, үә wollt ich mich vor-ste-ckenl 
Figure 54 


the forest in Boris Godunov (third act); with a similar motif Golaud lurks 
with his child in front of Mélisande's window (Pelléas et Mélisande, third 
act, fourth scene, ‘Parle plus bas: Que font ils?’); Pelléas feels uneasy in his 
secret meeting with Mélisande (in the fourth act, first scene twice, ‘J'entends 
parler derrière cette porte’) or Mélisande hides in Pelléas's arms in the 
fourth scene of the fourth act (‘Il y a quelqu'un derrière nous’), 
Complaint brings about а monotonous melody regularly ascending а 
semitone. The lamento choir of the Psalmus Hungaricus by Kodály is the 
Stylized form of the Hungarian intonation form of complaint. The ascend- 
ing emphatic tones embrace the lower unemphatic ones. The Lachrymosa 
of Verdi's Requiem as well as the lamento choir from the second act of. 
Monteverdi's Orfeo (' Amor, Amor, Amor’) reflect the paroxyton stress of 
the Italian language. The ‘embracing’ stress falls on the second syllable. 
The series of motives surges upwards (Figure 55). The monotonous melody 
of complaint is present in the first shepherd’s part (*Oh, quanto ё in vista 
Dolorosa...’) in the second act of Monteverdi's Orfeo, in the first 
tenor's voice of the choir (‘Quanto duol soffrir, ahimé!’), as well as in the 


308 Intonation and Music 


| 


Соте un lamenta. 


Figure 55 


Orchestra accompanying Orfeo's wailing (‘Tu se’ morta...’). This 

lamenting melody appears in the orchestra-when Donna Anna discovers g 
her father’s corpse (Don Giovanni, first act, cf. Jouve, р. 64), or when the t 
statue appears in Don Giovanni’s dining room (second act); Samuel 
Goldenberg and Schmuyle complain like this in Moussorgsky’s Pictures 


at an Exhibition. у 
Anger in music is similarly characterized by upward leaps of fourth and 


fifth from a straight melodic line as e.g. in Ozmin's aria Number 3 (first ^ 
act of Die Entführung aus dem Serail by Mozart), in Ozmin and Blonde's 3 
duet (second act), in Beckmesser's fuming part of the third act of Die К 
Meistersinger von Nürnberg by Wagner (Figure 56). In Elvira's part in the f 
^ 

у 

‘Aus sol пог Schuster-stu-ben hetzt end-tich er den Bu — ben mit Knüppeln auf mich her . E 

Figure 56 y \ ; 
dir Heuchler’) the melodic line “keeps + ) 5 


first act of Don Giovanni ( Weh" | 
level? under and above, interrupted by falls of fourth or fifth. | | 
Scorn is expressed by а descending melodic lineina mro range. uch 
а motif is sung by Elvira warning Zerline against ос ‹ ica in es i 
Number 8 of the first act (‘Verachte was ег spricht’). otan ^ dresses f 
Hunding in such a manner at the end of the oven o the Walküré 
(Figure 57). Golaud speaks about Mélisande with a like те! p ly (Pelléas, 4 
fourth act ge scene, ‘Vos longs cheveux servent en Айсен 
Chose), 4 „йез from a high register to low. The | ч 
5 ic melody the voice glides fr The К 
high шеш TS “emphatic and ристе Сане | 
placed Ы ђу ће punctuated note and t^ ү i | 
NO WA o anni ‘courts’ Elvira in order to send her off 
With such a motif Don Giovi : 


Ivan Fónagy and Klara Magdics 309 


T Џ т 
Geht hin, Knecht! Knie — е vor Fri — ска: meld' ihr dass Wo-tan's 


Speer ge—racht, wasSpott ihr schuf. Geh'l Geh'l 
Figure 57 


with Leporello, to be able to make love to Elvira's housemaid (second 
act). This melody alone could reveal Don Giovanni's intention, if Elvira 
would not cling so firmly to her illusions. This motif is heard from Masetto 
addressed to the almost perfidious Zerline (‘Wenn der gnád'ge Herr wird 
sagen . . 7); With such a melody Siegfried speaks to Mime in the second 
scene of the second act of Wagner's opera (Figure 58); Hans Sachs informs 


„..їйпбї du von Це ~ be gar апі 
Figure 58 


Beckmesser with this motif that his trial song is not good enough to meet 
all requirements (Die Meistersinger von Nürnberg by Wagner, second act, 
‘er hàlt's auf die Länge nicht aus . . .’), and inquires later after his health 
(‘Herr Merker, sagt, wie steht’s? Gut?’); the women mock the arrested 
guardsman, or the youngsters the ‘mad Ivanovich’ with such a melody in 
the third act of Boris Godunov; this is how Golaud speaks about Mélisande 
(Pelléas, fourth act, second scene, ‘les donneraient à Dieu lecons d'in- 
посепсе’). 

Despite the much greater (so to speak liturgic) restriction of ће melodic 
formation of folksongs on the one hand, and of the relative independence 
of their texts and melodies on the other, certain emotions are reflected in the 
same way in our folksongs as in composed music and in speech melody. 

Longing seems to be often accompanied by a descending and slightly 
rising motive in folksongs too (‘En Istenem add megérnem’..., 'Vis- 
szanéztem félutambul...’, ‘Álom, álom, mért nem jész....’ [ My God, 


810 Intonation and Music 


let me live to be with the one I love. . .’, “Т looked back half way on ту 
journey ...’, ‘Dream, dream, why do you delude me . . .’]’ etc.). Accord- 
ing to the statistics made on the basis of Kodály's volume (1960) a longing 
text was found with this motif in eleven songs while three songs expressing 
the same feeling showed other melodic solutions. In two cases other kinds 
of texts were accompanied by the ‘melody of longing’. 

The lamenting, plaintive melody is characteristic of popular mourning 
songs, and it appears in a great number of Hungarian folksongs (‘Sirass 
édesanyám...', ‘Árva vagyok, árva..., ‘Istenem, Istenem...’, ‘A 
búbánat, keserűség . . .’, ‘Szegény Szabó Erzsi”... [‘Mourn for me, my 
mother . . .’, ‘An orphan am I. . .", *My God, my God . . .", ‘The grief and 
bitterness . . .’, ‘Poor Erzsi Szabó . ..']" еїс.). In Kodály’s volume the 
melody of twenty-eight plaintive songs imitates the intonation of complaint, 
in nine songs the plaintive text is combined with other kinds of melody, 
and in two songs the lamenting melody accompanies texts of some other 
kind. 

Tenderness is expressed by gently undulating melodies, slightly descend- 
ing melody curves (‘Repülj madár, repülj . . 2, ‘Arokparti kökény . . x 
*Fürjecském, fürjecském . . .", “Zöld erdőben . . 2 [ Fly, swallow, fly . . x 
‘Blackthorn by the gully . . .’, ‘My little quail . . .", “In the green forest. .."] 
etc.), while quarrelling, anger are reflected in a melody ascending always on 
the same level (* Asszony, asszony, ki a házból...', 'Verjen meg m egek 
ura...’,‘Verjen meg az Isten . . .' [° Wife, wife, leave my house . . > Мау 
the Lord of Heaven damn you:..’, ‘May God damn you...’] etc.). 
According to the statistics made on the basis of the Kodály volume the 
distribution of intervals in songs with tender and angry texts 15 as follows 


(cf. Table 1). 


Table 1 Distribution of intervals in Hungarian folksongs 
with tender and angry texts 


Tender ` Angry 
percent percent 
pet ak ee ee 


first 10 3 
second 52 5 
third 30 20 
fourth 8 30 
fifth 30 
sixth 8 
Seventh 2 
Octave ^ 2 
SER da ie PS er 


Ivan Fénagy and Klara Magdics 311 


А 


References 


BoriuGzn, D. (1951), ‘Levels versus configurations’, Word, vol. 7, pp. 199-210. 


Cooke, D. (1959), The Language of Music, Oxford University Press. 

DEva, B. C. (1960), ‘Psychophysics of speech melody’, Zeitschrift für Phonetik, 
vol. 13, pp. 8-27. 

Fonacy, I. (1956), ‘Uber die Eigenart des sprachlichen Zeichens', Lingua, 
vol. 6, pp. 67-88. 

Fonacy, I. (1962), ‘Mimik auf glottaler Ebene’, Phonetica, vol. 8, pp. 209-19. 

Fonacy, I., and MAGDiCS, К. (1964), ‘Das paradoxon der Sprechmelodie* 
Ural-altatsche Jahrbücher, vol. 35, pp. 1-55. 

Jouve, P. J. (1942), Le Don Juan de Mozart, Freiburg. 

KopA ty, Z. (1960), А Magyar népzene, Hungarian Folk Music, Budapest. 

MATTHESON, J. (1954), Der volkommene Capellmeister, 1739, Reiman, Basel. 

TROJAN, F. (1952), Der Ausdruck der Sprechstimme, Vienna-Düsseldorf. 


U1FALUSSY, J. (1961), *Intonation, Charakterbildungund Typengestaltungin Mozarts 


Werken’, Studia Musicologica, vol. 1, pp. 94-142. 


& 


312 Intonation and Music 


Part Six 
Universality 


On the canvas that embraces all the languages of the world, is intonation 
to be painted as a common theme or is it as arbitrary from language to 
language as the connection between particular sounds and particular 
word meanings? When we find two languages with numerous words 
having similar meaning and similar sound – like French rapsodie, 
magique, tocsin and English rhapsody, magic, and tocsin — our first 
impulse is to assume a common origin somewhere, whether of the 
languages as a whole or of some set of vocabulary that came to be 
Shared. The impulse seldom leads us astray; there is usually other 
evidence to prove that the languages are related. If we have good reason 
to believe that there is no connection in space or time between two 
languages (short of the genesis of the human race itself), our impulse 
then is to expect no similarity in the tie between sound and sense. It is 
no surprise that the meaning “Jevel’ is conveyed by level in English and 


pling in Chinese. 


But suppose that against this background of no connection between 


Sound and word meaning that shows any kinship with English, we find 
that Chinese uses pitch in a dozen or more ways that duplicate its uses 
in English. It cannot be due to coincidence; the repertory of pitch 
Signals is not large enough — proportionately speaking it would be rather 
like finding a thousand words in Chinese that match a thousand in 
English, out of a total of a hundred thousand in each. With even 1 per 
cent we would have to conclude that something about intonation must 
transcend the usual language-specific connection between meaning and 


form, 

Part Six is only a sampl 
been published about their intonat 
Possible, Nevertheless, the theme 15 


The first Reading, by Larsen and ^ 
English nor Huastec is a tone language: Accordingly both are free to use 


intonation for attitudinal meanings with a minimum of interference, 
There is nothing to suggest that the two languages аго ew" remotely, 


ing of a few languages where enough has 
tion systems (0 make comparison 


unmistakable. 
Pike, concerns Huastec. Neither 


МР 


related in origin. The similarities that are found must then answer either 
to chance or to some inborn capacity of human speakers that is there 
to be unfolded in any language that encounters the right conditions. (A 
third possibility, that they are borrowed from Spanish which in turn 
resembles English, is hardly worth considering, given what is known 
about the tenacity with which languages cling to their intonation 
systems.) How closely Huastec matches English may be judged from 

the contours that Larsen and Pike describe. Here are four things to look 
„Хог by way of hints: 


1. The way the last syllable of Johnny is pronounced when calling from 
a long distance. 
2. The way Not so!, Never!, and the like are pronounced for great 
emphasis, with a slide down on each syllable. 
3. The way a question is asked when the speaker considers the whole 
idea absurd, e.g. Who eats horse? 
4. The way a pitch rises for great surprise. 

m 


Тһе system used for describing the intonation of Huastec is essentially 


that of Pike (pp. 53-82). The authors are researchers with the Summer 
Institute of Linguistics. 

One of the areas of most intense linguistic activity — mainly on the 
part of the Summer Institute of Linguistics — has recently been that 
of New Guinea and surrounding territories. Alan Pence did fieldwork 
on the Kunimaipa language for eight months between 1959 and 1962. 
The second Reading gives his analysis of Kunimaipa, using Pikes 
system, like that of Huastec in the preceding Reading. Though this 
language is quite unrelated to either Huastec or English, the fundamental 
resemblances are nevertheless obvious. Questions and statements are 
typically distinguished by rising (or high) versus falling contours. 
Perhaps the most striking resemblance to the intonation of Western 
language is the contour described as ‘mid high’ (p. 334), which is 
approached from a high pitch, goes down, and then rises at the end. 
The meanings ‘polite request, polite question’ are carried by it. 
Compare the English. 


you 
Do you 
i? Would me? 
like help 


Point-by-point comparisons between the intonation of one language 
and that of another are few. The third Reading is Isamu Abe’s 
comparison of English and Japanese, valuable for its proof of 


314 Universality 


similarities far exceeding the reach of chance, recognizable in spite of 
being hedged in and warped by all the other complicated movements 
generally peculiar to one corner of the world, that each language has 
to make in its daily business of conveying a universe of Ee р 

The fourth Reading, by Kerstin Hadding, compares some intonations 
in Swedish with related ones in English. Despite the complication of 
distinctive tone, the signalling of questions and statements in Swedish 
turns out to be strikingly similar to what is found in English. The 
method used by Hadding is as interesting аз the results. She makes her 
Comparisons by testing listeners with artificially produced intonation 
Contours, which permit precise control of the parameters of pitch, 
duration and intensity. Both her Swedish and her American listeners 
Teacted consistently when asked to judge the meaning of the contours 
and when asked to decide whether a given pitch movement was up or 
down, E А ibed 

ae fact that the intonation of questions in Italian can е Dee? 
by using ‘tunes’ that were devised for English says eee ed e 
the unity of intonation among Western langu eg End ne 
the direct inheritance of a common intonation ү иеп f t 
Indo-European. Or it may be due to the direct in is e ae 
deal else, which makes the daughter languages sitar enough in other 
Tespects so that they perturb the universal traits 0! | о SE 
Ways. More likely it is both. Whatever the GE e e by Mire CH 
the identities that one can discover in еі t E Sege cee 
Chapallaz. A striking one is the KEE d upon’. Compare 
an answer is courteously requested rather than 5 


the English and Italian: 


d 
Whi Quando oo 19 
oO can you go it? Potete far 


; ccurs at the lowest 
and contrast this pattern where the main EE the one where the 
Pitch followed by a rise, with the регетр 
following pitch is lower and the 


When 
CH уоп do eg 


re is no rise: 


ially in the timing of 
NO ourse, especia 
des dissimilarities to zond outlines are pretty muchi e sume 
1 + r 1 
The EE racteristics of intonation peer commonly gathered 
i general char: v any of the other pheno! d 
Toadly than those 0! S 
Under the label of ‘language ` 


of ci 


Universality 315 


D, Ри, 


EAM 
i — Aë з » 
eec Ge d 


à e + E. 


~ 
Dës, 


A kot 
vl э» ovra 5 
gës мен & yee qi 

VI T TY d) KH mL. 


T Ж 
(0 у 


“i 
d të wif Kä m 
уе m и Бо Y 
ү el dE 1 PL 4 


17 Raymond S. Larsen and Eunice Victoria Pike 


Huastec Intonation 


Abridged from Raymond S. Larsen and Eunice Victoria Pike, *Huasteco intonations 
and phonemes’, Language, vol. 25, no. 3, July-September 1949, pp. 268-77, 


Length 
Vowel length is independent of stress and intonation. Long vowels 
Contrast with short vowels in environments where both the stress and 
intonation are, within the limits of our perception, identical. In the two 
following examples the stress is on the first syllable (a raised dot indicates 
a long vowel): /bičow/ ‘town’, [bi-nom/ ‘giver’; in these two examples the 
Stress is on the second syllable: [cem0a-b/ ‘being killed’, /ce*mla:/ ‘death’. 
The intonation of all four examples can be that of the narrative contour, 
in which case the pitches of the first two and those of the second two are 
alike, $ 

Vowel length is not dependent on the position of the vowel in the word, 
Long and short vowels occur in all possible combinations in dissyllabic 
words (in the following formulae S indicates а short vowel, L a long 
vowel): SS /?at’em/ ‘salt’, /calam/ ‘shade’; DS Љиг] coward » [?e-yal/ 
‘boss’; SL /ciyo-k'/ ‘chin’, /?amurl/ ‘rubbish’; LL /?irla:b] ‘seed ban 
‘many times’, Likewise, all possible combinations Б уе H trisyllabic 
Words: 555 /hilk’omač/ ‘leftovers’; LSS /PaSuslom| | field of garlic’; 
SLS /k"ahi:lom/ ‘widow’; LLS /huču'k'čik/ Se ; SSL /alaberl/ 
‘pretty’; LSL Љепота:сј ‘one who gave ; ae d И вате, 
Plaything’; LLL /?e-a-Swary/ ‘(they) Ue Ct p RU EIN d 

The following are words contrasting опу "7. SE EN 
/?0k’/ ‘head’; /cabal/ ‘cooked corn ; ebe) Bee о у E 1 
Sold (it)’, /?u-nu-hul/ ‘he is (ог We are) selling’; /?in-t’okat/ ‘I am clean’, 


/?in-oka:t/ ‘his cleanliness’. : 

ith one of two or more different 
Phonemically long vowels occur Wi М 5 
Phonetic EE o phonetically тода Sot SEN e SE 
Syllables. The honetically shorter variety occurs апуу; a ut phrase- 
Bn E a longe подове variety than does the 
XR asp Tak A [tsa'ku:1] ‘angry’ [yani] [ya"'ni:l] “many times’, 
dot): /caku 45 occur in à position other than phrase-final, they 
` iety: /?i HERD 
Sg these E d phonetically longer уйчу, Re у 80:2] 
P ita, coo: geg arc angry now’, /yarni lk ale Гуа" ni Kale] “he went 
Коо: 


Raymond 5. Larsen and Eunice Victoria Pike 317 


ощ. т-у, 


Sos" "Ze me 


SE 


TES rires ia 
H ZE Zeg, 


' many times’. That is, long vowel phonemes are phonetically longer in 


phrase-final position than elsewhere. 


Potential contour point 

Intonation and stress are both described in terms of a point in the word 
which is designated as the potential contour point. This is located on the 
last long vowel of the word, or, if there are no long vowels in the word, 
on the first short vowel, regardless of the number of vowels in the word. 
In the following formulae of long- and short-vowel sequences, the syllable 
containing the potential contour point is in italics, Dissyllabic words: SS, 
LS, SL, LL; trisyllabic words: SSS, LSS, SLS, LLS, SSL, LSL, LLL; 
quadrisyllabic words: SSSS, SLSS, LLSL, etc.; monosyllabic words: S, L. 


MA (Proclitics and parts of compounds, in our transcriptions joined by а 


hyphen to the following word, are not reckoned in the location of a 
potential contour point.) Because of this difference in the placement of 
the potential contour point, a word that contains only short vowels sounds 
very different from a word with one or more long vowels, even though the 
intonation contour may be phonemically the same. 


Intonation: phonemic system ` 


In Huastec conversation, a sentence may recur with a variety of pitch 
sequences, The difference in pitch from utterance to utterance is especially 
noticeable at the end of phrases. The pitches on which the successive 
syllables of an utterance are pronounced form characteristic sequences of 
contours. These contrast with one another, and are thus phonemically 
diverse. The pitch levels which compose the contours are pitch phonemes; 
there are at least three of these, and apparently no more than three. We 
symbolize them by accent marks over the vowel letters and the length dot: 
an acute for high pitch, a macron for mid pitch, and a grave for low pitch. 

The contrasts between the levels cannot be analysed in terms of less 
than three; but further phonetic levels of pitch appear to be analysable as 
conditioned varieties of the three intonation phonemes. A mid pitch ола 
syllable in a phrase-final word is higher than a mid pitch on a syllable in 
other words, whereas a low pitch on a syllable in a phrase-final word is 
lower than a low pitch on other syllables. That is, there is a greater 
interval between a mid and low pitch in a phrase-final word than in other 
words. | 

In relation to the sentence as а whole, the intervals between the three 
phonemic levels depend upon the mood of the speaker. A tired or pouting 
person may talk with a low voice and narrow intervals, whereas an 
animated conversation may be carried on with wide intervals between the 
levels. 


318 Universality 


Contour point 

Although each syllable is of necessity spoken on some pitch, the pertinent 
pitch sequences which contrast with other sequences begin on a contour 
Point. 'That is, the contour point is the pertinent beginning point for a 
Significant intonation contour. For the most part these contours begin 
at the last potential contour point in the phrase; such a point is herecalled 
а routine contour point. Certain other contours begin on a syllable other 
than the last potential contour point in the phrase; such a point is called a 
Special contour point. Most of the significant contours are composed of a 
Sequence of two phonemic levels; unless otherwise specified, one occurs 
at the routine contour point, the other at the end of the phrase. The pitches 
Preceding the contour point, or between it and the phrase-final pitch, are 


predictable and therefore non-distinctive. 


The precontour 

This is the pitch sequence of the syllables preceding the contour point. 

It is predictable and therefore need not be symbolized in a phonemic d 

transcription. i ^M 
In fast speech, all syllables preceding the contour point (regardless of 5 


Word boundaries) have mid pitch: //?uteyic Коуо-С tana:? ?a-1an-K'ima^// 5 
Latéyitsks yo-tsta na-7-,a-Kink’i'ma*0] ‘He drew near and rested there in Д 
the house. In slower speech, word boundaries (here symbolized by 
Spaces) are important. The rule for slow speech is that in every word, 
every syllable immediately preceding the potential contour point d that 
Word has low pitch. The sentence already cited is AE [,^üt&yit- 
Skó;yo-tstà na-?.a-lank'i'má- V]; compare also ian SE шш 
?o"w wéhàt// Loge ај бола реба “о web t] "Fhen far away ће 


found a cleared spot’. 


The intra-contour 


This is the pitch sequence which 
tour point and the end of the co ription 
2 A ic transc f 
Кез Weis per of th штей contour (Gon E 

all intra-contours occur within one VOR UR crue UE 
Sequences in which an intra-contour P s rt vowel ашан CURAR 
ап intra-contour, a sequence must endina a din a long vowel, both the 
three syllables or more; for if the sea е contour would be contained in 

+ В "n ош i 
ginning point т Шеш consisted of two syllables of which the 

Owel; an e 


a transcription including 


occurs on the syllables between the con- 
ntour. It is predictable and need not 


pitch symbols. 
1. Double slant lines enclose 


nd S. Larsen and Eunice Victoria Pike 319 
Raymo I 


second contained a short vowel, the beginning point of the contour would 
be on one of the syllables and the end point on the other. Of trisyllabic 
words, only words of the type SSS and LSS contain intra-contours, 
because they are the only words with the potential contour point on the 
first syllable. Of four-syllable words, only SSSS, LSSS, SLSS contain 
intra-contours. Words of five or more syllables are similarly limited. 

The pitch of the intra-contour is the. pitch of the lowest level of the 
'contour, unless that lowest level is mid, in which case the intra-contour 
may occasionally and optionally vary to low. That is to say, if the contour 
is high-low, mid-low, low-low, low-mid, or low-high, the intra-contour 
is low. If the contour is high-mid, mid-mid, or mid-high the intra-contour 
is mid, optionally varying to low. We have no example of a high-high 
contour. Notice the intra-contours of these words: [[?àhtitmà?// 
[''áhtit*mà?] ‘singer’, //?àhtitmá?// ['?áhtit*má?] or ['?àhtitmá?] ‘a singer, 
you say?’, //?ahtitma?// ['?ahtit‘ma?] ‘and a singer and . . .', //Pahtitma?// 
['?àhtitmá?] ‘not a singer?!’. 


Intonation: morphological system 


Certain connotations which are not expressed by morphemes composed 
just of segmental phonemes are added by means of ten or more different 
intonation contours. Each contour is a sequence of two intonation 
phonemes. Since these pitch sequences are not intimately related to 
specific lexical morphemes or sequences of morphemes, and since their 
meanings are various attitudes of the speaker superimposed upon (ће 
more concrete (and more stable) meanings of the words, we have analysed 
them as intonational features rather than as lexical tones. Each significant 
intonation contour is a single intonation morpheme, since it is meaningful 
as а whole and cannot be broken into smaller meaningful units. 

Certain of the intonation contours will be first illustrated by a sequence 
of examples in which the word /?iba:/ ‘no’ contains the same segmental 
phonemes, but different intonation contours and different connotations. 
//?iba*// (emphatic) 

//?iba*// (matter of fact, without emotion) 
//?iba*// (preoccupied, uninterested) 

//?iba*// (called to a person a distance away) 
//?iba"// (unfinished) : 

[[?ibà */] (questioning: ‘did you say no?’) 
//?iba*// (deliberate or thoughtful, with surprise) 
//?*iba*// (finality: ‘absolutely not") 


This word ‘no’ shows how the several contours may be used with опе 
word. Regardless of the contour, /?iba-/ still retains the lexical meaning of | 


320 Universality e 


‘no’; but as the contours vary there are implications of different emotional 
attitudes on the part of the speaker. 


The narrative contour is mid-low, varying to low-low. Semantically it is 
rather colorless, its chief characteristic being lack of emotion. It is used in 
both statements and questions. It is located on the last word of the phrase, 
beginning on the routine contour point and ending on the last vowel of 
the phrase. If the routine contour point falls on a phrase-final long vowel, 
the contour is a glide from mid to low. On a phrase-final monosyllabic 
| Word with a short vowel, the contour is a simple mid pitch.? Examples 
| of the narrative contour on isolated words: S [[hà?]] ‘water’, L /[?a^&J] 
‘grandmother’, SS //bēšè?// ‘badger’, LS //būčùl// ‘partridge’, SL 
l[coló*mJ] ‘lace’, LL //ya-ntl// ‘many times’, SSS [[vàk'a&té?//. ‘jail’, 

LSS //?éyaléik// ‘bosses’, SLS //?ic’a-mal// dest, LES [[?u:čāš-čìk// à 
" (they) speak to each other’, SSL //tomkinē"l// ‘marriage’, SLL //?aki-- 

là^b// ‘carrying-shawl’. 

The low-low alternant of this con! 
ending with a long vowel. This form of 
With the basically low-low intonation mo! 
//halù*b// ‘namesake’. 

7 frequently than any other ры. 


The narrative contour occurs more Te na 
In a certain text of 43 sentences by one informant, it is the only phrase- 


final contour used. 
4 
The emphatic contour (high-low ~ high-mid) puts extra emphasis on the 
Word on which it falls. The high-low alternant of this morpheme occurs 
: inning on the routine contour point: 


On thi ase, begi 
Жыр ые (im. ‘He doesn't want a donkey, 


[lhah yab in-le-? i-bú'rrò, ?in-le? E? с 
he wants a horse’; //tiwa? пе?е© ап-К”аМ1дт// “There goes the widow", у 


The high-mid alternant occurs on а non-phrase- mA E beginning 
On a special contour point and ending on the final syllable of the same 
Word. In this case another intonation contour, beginning on the routine 
Contour point, is present in the same phrase: //?in-cém@a?_an-7inik// 
“He killed the man’. ài 
hat the speaker is preoccupied 
he speaker when he is busy or 
ildren: //ka-t'aha? ?ancana*?// 


tour optionally occurs on words 
the morpheme is homophonous 
rpheme (see below): //halü*b// ~ 


w) signifies t 
It is used by t 
n scolding chi 


The detached contour (low-10 

Or uninterested or disdainful. It 

thinking ofsomething else, andi 
Do it this way!". 


er contours when they fall on a phrase- 


th 
2. We neglected to check the form of theo 


nal monosyllabic word. 


Raymond S, Larsen and Eunice Victoria Pike 321 
a! . Y 


The call contour (high-mid) is used (1) when shouting to or calling some- 
one at a distance; (2) when the speaker is startled or frightened; (3) for 
emotional emphasis. When the routine contour point is on the phrase-final 
vowel (always a long one), there is a glide from high to mid. When the 
routine contour point is on some vowel other than the last, the contour- 
point vowel has high pitch and the vowels following it have mid: //hosé"// 
*Josenh!', //benhamí"n// *Benjamin!', //затај/ ‘Sara!’, /[katarí-nà// 
*Katherine!", //ka-met’a? an-7ic’A-mal// ‘Look at the deer!’. 

Optionally this contour may be accompanied by a lengthening of the last 
vowel of the word (if that vowel is lexically short), with a consequent shift 
of the routine contour point to that vowel: //katari-na// ~ //katari-na”// 
*Katherine!', //ta-ta// ~ /[ta-tá'/] ‘Father!’, //ka-met'a? am-bi¢im// ~ 
//ka-met’a am-biéi"m// * Look at the horse!” у 

One expression has been noted in which the entire contour falls опа 
non-final short vowel; with the occurrence of this contour that vowel is 
lengthened and the high-mid glide begins and ends on it: //ni-háyk'i?// 
‘never!’ //ni-há'yk'i?// ‘absolutely never!’. In this case the post-contour 
pitch is mid, 


The sequence contour (low-mid) indicates that something is to follow. If 
th tine contour point is on some vowel other than the phrase-final one, 
the phrase-final vowel is mid and the contour-point vowel is low: [[?àt'ém// 
‘salt’, //?й-$@8// ‘garlic’, //cOcoblék// ‘a kick". If the routine contour point 
is on the phrase-final vowel, the contour may be a glide from low to mid, 
or optionally a low pitch on the last vowel but one and a mid pitch on the 
last vowel. In the latter pronunciation the contour begins on a special 
contour point: //ce‘mla’// ~ //cé-mla-// ‘death’, 

This contour most frequently occurs before short pauses, where it 
connotes a sequence: //?in-le-? ап-7аћап, ?an-bakan, ?ani han-cabal.// 
“He wants a roasting ear, a tortilla and some cooked corn’. When the 
contour is used before a long pause it indicates that the speaker expects 
to say more. 


The hesitation contour (mid-mid) is similar in meaning to the sequence 
contour but is less deliberate. Whether before a short or before a long 
pause, its connotation is that the sentence is unfinished: //?ac’e-m 
an-?à-$ü$// “Тһе garlic was wet—’. 


The question contour (mid-high) is frequently used by someone repeating 
what another person has said. By means of this intonation he asks, ‘Is that 
what you said?’ Examples: //?ic’i:lom// ‘playful’, //?ic’i-lém// ‘Did you 


822 Universality 


say playful?’; //k’ale ya-nr// "Не went many times’, //k'ale ya-ni<1// “Не 
went many times, did you ѕау?”. 

This is also the intonation used when assent or dissent is expected from 
the one spoken to: //ne?ec ta-?a.im k'al an-to?ól// “Are you going 
fishing?’, //k'a?i-I an-t’élé?// ‘Is the baby hungry? а 

Where the routine contour point of this intonation morpheme falls ona 
Phrase-final long vowel, the contour is a rising glide from mid to high. 
When the routine contour point falls on a vowel other than the last one, 
No glide occurs, but the pitch steps up to high on the phrase-final short 
Vowel, not earlier: //čubaš in-t’aha~// “Does he surely do it?’, //kida-b 
an-7i0ibloméik// ‘Is the corn ugly?’. 

The precontour preceding а question contour (p. 319) is more frequently 
spoken rapidly, with mid pitch, than slowly, with mid and low pitches: 
llya'nic in-k"i. 2ya-mál// y&nitsink"izi'ya-mál], or, їп slower speech 
Liya-nitsink") ?i'ya-mai] ‘He hunted а lot, you say?”- 


The unexpected contour (low-high) is used when the speaker is surprised 
ог startled, but is deliberating about what has happened or has been said. 


If the last vowel of the phrase is 1008, the contour is a glide from low to 
wel of the contour point is low and 


high; if th i 

; if the last vowel is short, the УО ИУ 

last vowel is high: //hale? tin-^ulal in-le:? i-cànák™// “Why does he say 
€ wants beans?’, //?ancana"? in-'aha:l a-halü * b// ‘Does your namesake 

do like this?" 


The superemphatic contour (mid-low mid-low) SE erer dd m 
the example we have given, it change tel ‘no’ to /[^i'ba'JJ "absolutely 
Not’; the short vowel in the first syllable change’ EE 
Contour seems to be more emphatic than the Шр gror E 
two syllables; when the routine contour E A E nal vowel, 
is contour must of necessity begin 094 pedal co 

been noted in at least two 

special contour point, located on the 
ich ys ре and steps down gradually to 


ss contempt contour (high slurred to low) has 

*amples, It starts with high Р! ка) 
a4 Potential contour point of Шер Ир. 319-20) isslightly lower than the 
W. Eachsyllableof theintra-conto m-bičim// ‘who eats horse?’, //hánt'o 
8. » The connotation of the first is 


Preceding one, //hita? kin Каруу" а 
oking addressed to à year-old baby. 


a-wašà:1// “What are you 10 
was ` 2 
Contemptuous, disdainful; the nd ^ discovered. A high-high 
ы © other intonation MOFPT uos ossible within the dics Any 
EE would appear to Бе! h Jike the опе discussed in this section, 
Would have to be special DË ` 
Larsen and Euni 


wo S ce Victoria Pike 323 
Raymond >" UA CE 


"LA. e w 


or a combination of types, like the one in the last paragraph, or would 
force a different basic analysis. 


Summary 
Huastec is considered to have phonemic vowel length because (1) certain 
minimally different words are persistently differentiated by length alone, 
(2) all the possible sequences of long and short vowels occur in words of 
two and three syllables, and (3) the differences in length persist in spite of 
intonation. 

The pitch differences heard in Huastec are considered to be intonational 
(i.e. to constitute pitch morphemes) rather than ‘word tones’ because (1) 

` there is no lexical pitch contrast between words of the same consonant- 

and-vowel pattern, and (2) the choice of a particular pitch sequence is 
determined.by the attitude of the speaker, not by any lexical consideration. 

In place of а system of contrastive lexical tones combined with some 
overlapping intonations, like that reported for the Maya of Yucatán, we 
have found in Huastec an intonational structure with a. restricted number 
of morphemes, each composed of a sequence of pitch phonemes, 


324 Universality 


18 Alan Репсе 


Intonation in Kunimaipa (New Guinea) 


ріал Pence, ‘Intonation in Kunimaipa (New Guinea)’, Linguistic Circle of 
талена Publications, Series A Occasional Papers по. 3, Australian National 
niversity, 1964. Now Pacific Linguistics. 


Introduction 
This paper concerns one aspect of the phonological system of the Kuni- 
maipa language.! It is an analysis of a system of pitch signals which are 
distributed over phrases, and which add shades of meaning to utterances.? 
In analysing this intonation, two ideas current in the theoretical work of 
Kenneth L. Pike have been of help. The first is the idea of hierarchy, Pike 
Tegards phonology as made up of basic building blocks (units) of various 
types. These form a series of levels which he organizes smallest to largest in 
a V-shaped display. The smallest unit is the phoneme. Phoneme units are 
distributed in such a manner as to produce syllables, and these in turn make 
UP Phonological words, and so on. The intonation of Kunimaipa fits into 
the total Kunimaipa phonological system at а mid level, which will be 


called phonological phrase. "P 
The second idea found helpful came out in 1945 in Pike's treatment of 
е dichotomy he made between 


American English intonation. This was thi с ч 
Precontour and primary contour. In the current literature, theterms margin 


and nucleus are used (Pike, 1962). These terms indicate that we may expect 
to find in phonological systems, peaks of activity and troughs of activity. 

€ may find peaks with certain characteristics, and troughs with differing 
characteristics, The terms prenuclear contour and nuclear contour are used 


8000 spe: 
dialect studied 
District of New G 
1961 and 1962 under the 


graphy which represents 
affricate, and fricative 


akers of Kunimaipa live in the Goilala Sub- 
here is that spoken in the Bubu River 
uinea. This analysis is based on 
Summer Institute of 


ub The main body of the some 

istrict of Papua; however, the 
агеа near Garaina in the Morobe 
field work done in the area during 
Linguistics, 


V=/3),j= = stop, 
wi eS eL he ll Cree Innes e [ea Io fl 
u~ fof. 


2. Pike (1945) defines intonation in this мау. 
Alan Pence 325 


1-18 


in this paper to designate trough versus peak activity, and the dichotomy 
has proved very useful in simplifying the description. 

The total Kunimaipa phonological system is described in terms of a 
hierarchy of levels. On each level, units which occur are described in 
relation to the units with which they contrast, their internal modes of 
variation, and their distinctive distribution. Each level is seen as having 
units which are in turn distributed on higher levels. A full expansion of the 
system is seen in the example (extracted from text). 

SIS sorgo | ейсадап bon | паједаг oya eg// 

Lo [n 
"Going to inspect (the traps), he found no (game); they were still set.’ The 
whole is a phonological sentence (//). It is subdivided into three phono- 
logical phrases (/), six phonological words (double space), and numerous 
syllables and phonemes. Pitch is marked by solid and. broken lines; high 
pitch above the letters, mid pitch below the letters, and low pitch con- 
siderably below the letters. Solid lines indicate crucial pitch points; hori- 
zontal dotted lines indicate non-focal or fluctuating pitches. 

In other examples a single syllable or segment may occur as the highest 
Tevel of the system. The phoneme /e/ occurs as a syllable, and when spoken 
in isolation with intonation and other features d II ‘yes’, it is а 
phonological sentence, L " 

Intonation is an independent -system closely related to the whole 
eae. of phonological elements, It fits into the system at the level 
which we call phonological phrase (P-phrase), making this a very diverse 
part of the system. 

„Units of the Kunimaipa intonation system are primarily defined by 
pitch. The minimum units of the system are three pitch levels, the intonemes 
high, mid and low. These units combine into sequences which we refer to aS 
pne contour and nuclear contour. There are four contrastive types of 
prenuclear contour: stepping, rising, falling and level. 5 
of nuclear contour: high, mid, low, high-low, ees We 
high-low, high-high-mid, mid-low and mid-low-mid. ` ch 


optional prenuclear contours is termed an intonation 
саг conto word (Ii-word). In 
the example, te рејауо isle ‘at the wall’, the first four E with their 
uM 
pitch pattern constitute the prenuclear contour. The isani j 
ttern consti . The whole is an intonation 
word, Pitch is indicated by the solid and broken lines, mid first syllable, 


high syllables two through four, and low final syllable. The emic? content 


3. By emic content is meant the content as the hearer 


woul ize it i of 
the system of the language. The /p/ in Pay! differs in жле нү, 


pronunciation from the /p/ ЇЗ 


826 Universality 


of this I-word is a stepping prenuclear contour followed by a low nuclear 
contour. 

While the total system is described in terms of three level intonemes, in 
reality these registers are based primarily on occurrences of the nuclear 
contours. Though prenuclear contours are describable in these terms, in 
Some respects they appear to function more directly as total contours (note 
above the contrast in the labels given to prenuclear v. nuclear contours) 
Which coincide at some points with the levels of the nuclear contours, 

Pike (1945, p. 70) notes a similar situation in English intonation in the 
“descending stress series’. This is a unique contour in which there may be 
‘more stressed syllables or distinct pitches than can be fitted into four 
levels”, A 

The main points of contrast between the prenuclear and M t con- 
tours are: (a) prenuclear contour glides occur only ofan af i ble, it 
daries (except that if the prenuclear contour consists of a SE ZC е, 
may ђе gliding); nuclear contour glides may occur on one sy Я 2 dad 
Syllable boundaries; (b) prenuclear contours may ird 28 E sa 
Syllables, nuclear contours occur on а BE E с IL AUC by pue 
Sequences of up to four prenuclear EE E Geh БЕС, 
Pause usually occurs between sequences 0 P urs; (d) prenuclear con- 
nuclear contours and following prenuclear EAE ceti of meaning; 
tours occur at various pitch heights with no à у асһапре of VERBIS. 
a change of level in the nuclear contour D үа ч bligatory to the Do 

Though it is the nuclear contour which is oblig d 


the prenuclear contour appears "2 renuclear contour has been 
load in the system. For this reason the term р! 
Е п 
Chosen, rather than one such as preco 
function tours, then the levels and 
У Р renuclear contours, 
This paper will describe first the p 
e d ше а SE the Kunimaipa Viri Me pe | 
eg eg EE taped text of varioussortshave . 
ndaha! 

tat : 
CT drawn and added to them in 
n not in any way exhaustive, 


Rare nuclear contour eet types are only partial 
butional limitations of the V 

as акау carb 

Stop!, but in spite of the pl 
labels both /p/. Similarly if @ 
as low even though it may be# 


NC K 
i f uptiesitatingly 
F native speaker о! А taung 
lhonetic die intoneme of low, Lil e recognized 
tangas comet mes, or a bit lo: tis emically' low. 
bit highe Le 


\ БА, 


pute d 


E 


addition, since the analyst does not have a full command of Kunimaipa, it 
has been impossible to approach a complete description of the meanings of 
the various units. 


Prenuclear contour 


In the data analysed, four types of prenuclear contour have been noted: 
stepping (mid-high), rising (low-high), falling (high-low), and level (mid- 
mid). As implied above, the internal relationship between the pitches of a 
prenuclear contour is more important than pitch height itself. 

The stepping prenuclear contour is basically a mid-pitch initial syllable 
followed by an optional -high syllable, optionally followed by one or more 
syllables neutral in pitch. The contour has a meaning of normal or declara- 
tive statement. Figure 1 is a diagram of this contour. 


[EI "rp = === = 
1 


X 


Figure Stepping prenuclear contour 


The first two syllables of this contour may be pronounced with low to 
mid oreven low to high pitch. The third syllable (neutral in pitch) may be the 
same pitch as the second, slightly higher (the vowel /a/ tends to draw this 
pitch up), or slightly lower; additional neutral-pitched syllables usually 
decay in pitch. The final neutral-pitched syllable may be drawn up by а 
following intoneme. Neutral syllables may be very short, or even voiceless 
following /s/. A one-syllable statement prenuclear contour may be either а 
level mid pitch, a low to mid rising glide, or a mid to high rising glide. These 
patterns are considered allocontours since they appear to vary freely; 
however, more investigation at this point is needed. In the following ex- 
amples, vertical stroke (/) divides the prenuclear contour from the nuclear 
contour. In parentheses following the lexical meaning is an indication of the 
intonational meaning of the contour which is being illustrated, 
те, }1рато [mot ‘our things? (.) 
da/ngasi [par ‘a weapon’ (.) 

p 


maat ‘a red thing? (. 
ш ат 2" () 


Bez? [nap ‘man’ (.) 
—— 


A 


ma/e “nati me [mo Н oh ‘They used to kill others,’ (.) 


328 Universality 


The rising prenuclear contour is basically a pattern which begins with a 
low (or occasionally mid) syllable and rises regularly on each succeeding 
syllable to a final high syllable. It has a meaning of incompleteness or 
Sequence, and contrasts with the stepping prenuclear contour in that (a) 
it often begins lower, (b) the initial upstep is smaller, and (c) each suc- 
ceeding syllable is higher in pitch than the previous. Figure 2 is a schematic 
representation of the rising prenuclear contour. 

T 


e 
Pd 
P 


C — mg 


Figure 2 Rising prenuclear contour 
urring on one to three syllables may 
n two syllables often do not rise to high. 


Four or more syllable occurrences tend to rise the full low to high range, 
thus in longer ones the up steps between syllables are very short. This 
contour is often followed by à high nuclear contour, though most others 


may also occur. 


2 Rising prenuclear contours OCC 
begin either low or mid, and those o 


теїра/тс\, /mot ‘our things’ б.) 
сг = 
Tangi,jah *Helitit.'C.J 


Sapafne/puh ‘He will go and’ (...) i 


Pop veir viiha/puh ‘This one they covered and left, and’ (. . .) 
с Ыры, E 
ins high and falls progressively 


ntour begi 
s to be excitems 
comparing Figure 


tement. The contrast between this 


The falling prenuclear СО 
3 with Figures 1 and 2. 


throughout, Its meaning seem 
and preceding types is seen bY 


Wel 
M 
Gr 
E e == 
Figureg Falling prenuclear contour 
; with each syllable. However, 
e pitch drops IM 
in As Ke. or ng phonological wes E 
е; SU ing; but WI 
contour is love in pitch than the preceding; 


Tises slightly. 
Alan Pence 329 


SA, Eh 


окоћ пав“ ат verevat am hohoranev hao/han 


— EE 


сау down somewhere they came out and chanted’ (1) 


The level prenuclear contour is a sequence of mid- (or occasionally high-) 
pitched syllables. It has a meaning of suspense. It contrasts with the three 
other prenuclear contour types in that there is no significant rise or fall in 
pitch throughout. Figure 4 is a schematic representation of this type. 


Figure4 Level prenuclear contour 


Carefullistening to taped examples of this type reveals minute variation, 
up or down, from one syllable to another. This variation is without pattern, 
and does not affect the level character of this contour. The mid-level 
nuclear contour commonly occurs following this type; however, various 
others (low, high, bigh-low) have also been observed there. 


zeiparo/mot ‘our things’ (—) 


aban pongariv tin [am ‘two men, very carefully. . .* (—) 


menaui/a kill chant (— 
елаша ant (—) 


пі vii“/hoj ‘You go on and put it,’ 68) 


Tn text, sequences of prenuclear contours occur without beinginterrupted 
by pause. In a preliminary check of the sequences of two which might 
occur, only the following were not found: stepping-falling, rising-falling, 
level-falling, level-rising, rising-level, and level-level. The таге falling and 
level prenuclear contours are, of course, even more rare in sequence. In 


the following examples, plus (*) indicates a break between prenuclear 
contours. 


so; hot *ka fhat 3xof pi [vo 
— 
“We kept going up inside the mountain, and’ (.), (.), (.) 
giftahar *a/kah hii ih ‘Later they put him way up there.’ (.), (.) 


safor ivo/sihoi *reipa/ro} Шо} 


—_-" 


330 Universality 


€, a o younger sisters and brothers, we all...” (.), (.), (...) 


po. ui tkakam JC fot oh ‘Those ones were pained.’ (!), (—) 

One- or two-syllable prenuclear contours are often ambiguous as to 
whether they are one type or another. A one-syllable contour with pitch 
in the mid to low area might be interpreted as either stepping, rising or 
level type. A one-syllable contour with pitch in the mid to high area might 
be interpreted as either stepping, falling or level type. A two-syllable 
contour rising from low to mid might be interpreted as either stepping or 
rising type, Three factors are considered in interpreting such occurrences; 
(a) the height of the pitch, (b) the size of the rise between syllables and (c) 
context, The first two are applied according to the contrastive features 
already given of each prenuclear contour type. Cases which are still am- 
biguous are interpreted as a type which would be likely to occur in the 


Intonation context. 


Nuclear contour 

There are ten types of nuclear c 

mid-low, mid-high, mid-high-! 
Among the variants of the nucl 

their occurrence in the P-sentence. 


ontour: high, mid, low, high-low, high-mid, 
low, high-high-mid and mid-low-mid. 

ear contours are those conditioned by 
The final syllable of the P-sentence has 
fast de ifts quickly into voicelessness. P-sentence medially 
MD pur nga nuclear contour tends to have a more 
Controlled dynamic. In addition to this онно, SH with final 
high and low intonemes at P-sentence boundaries tend to glide to extremes 


Placement 


Each I-word has a nuclear contour. In mos 


: hast 
EN ie occasional на pation of such occurrence needs 
Wo final contiguous vowels. The 


in distinct or emphatic speech, the 
further 5 се boundary 1n dis 
tudy. At a P-senten 
nuclear m mik es occurs on à final unstressed CV syllable. In the 


— i uclear contour occurs on the final 
ехатрје, го [pu ‘a boy?’s {һе high ni 
contour а 


t I-words this occurs on the final 
he nuclear contour spread over 


nd P-word nuclear stress occur 
ntence ends with a voiceless variant of the 
e C а US. 
on ere? SE с is not obligatory ТО contrast to Pol 
О\уеу‹ ы See isit omitted. pi Ge 
er, only in rare са: s of em 
Nuclear t occur. :onally a P-word or P-phrase 
str es по sionally 
Variation ји SCH intensity, 50 that Se 
y Alan Pence 331 


Syllable whereas the prenuclear 
Оп the initial syllable. If the Ре 


nucleus may ђе very loud, and occasionally very soft. The following 
examples illustrate the placement of the nuclear contour. 


halito'/kor ‘doorway? 
Ља! кот Е у 


Rhasa//ha ‘He went?" 
СЕ ] 


hay [saha ‘He went?" (with voiceless final vowel) 


Description 

"The high nuclear contour has a meaning of impending, incompleteness 
or normal question. It occurs following the stepping, rising and level 
prenuclear contours. It тау occur on a final non-stressed CV syllable 
following the rising prenuclear oontour. In this occurrence the contour 18 
extra-high; elsewhere at P-sentence boundaries it is high rising; at P-phrase 
boundaries it is a leve] high pitch. 
xe/iparojmot ‘our things’ (incomplete) 


yati ет уі; Dat ‘took, came, and left it, and? (incomplete) 
рша“ ha/puj ‘the flute's? (incomplete) 
— 


vat g me/ngi “You brought it? (incomplete) 
—“ 


ni vii//hoj ‘You go on and put it.’ (incomplete) 
The meaning of the mid nuclear contour is unknown.^ It occurs follow- 
ing the stepping, rising and level prenuclear contours, and has been 


Observed to occur on a final non-stressed CV syllable. It is a mid-level 
pitch in all of its occurrences. 


те fiparo | /mot ‘our things’ (meaning unknown) 


menge; git [puh *We helped them and . , „" (meaning unknown) 


о" 


4. Because of the distinctive use of this contour and the place which it fills in the 
System, it is assumed that a meaning contrast exists. 


832 Universality 


3 


The low-nuclear contour has a meaning of normal or unemotional 
Statement. It occurs most often following stepping prenuclear contours; 
however, it has also been observed following rising, level and falling 
prenuclear contours, and on a final non-stressed syllable. It occurs as a 
low falling glide or extra-low pitch at P-sentence boundaries. At P-phrase 
boundaries, it is a low-level pitch. 


теу iparo | /mot ‘our things’ (normal) 
t 


afnga\/moh ‘Lam telling you all." (normal) 
mefnapaj | /hat “thinking to set (traps) . . ." (normal) 


— 
— 


ета/ћа /puh ‘they came and... „* (normal) 


Pi [ma ‘his...’ (normal) ^ 
ier 


1 E 

angar ai/bu ‘people’ (normal) 4 
The high-low nuclear contour has the (urs ofan WEIER ~ % 
Usually follows a stepping prenuclear contour, ЕЯ has а = P SR SCH di 
to occur following rising and level. At ee Ge hà b 2 
drifts quickly into voicelessness and the ЧОЕ daries the de oint Ў 
terminate at any particular point. At P-phrase DINE Feiere poini К 
is more obvious because of contrasting nuclear sya у! ics. a 


Прагој í ent t 
Ze/iparo]mo,t ‘our things’ (announcem! D ag 


t- H H 
rk ill others’ (announcement) ry 
лаје "mar Seet oh ‘They used to К d 
Ve M 
nea,’ fro! ‘Child’ (announcement) i 
L 
H (announcement) d 


hamab/afhoip “а big snake 
D nd 


H 


ита "(ап t) 

һа i nouncemen 

hat _etet_he/ jai ‘You all listen’ G | 
e tentatively) а meaning of polite | 


ur has ( 


— 
The high-mid nuclear соле only following the stepping prenuclear d 


Statement, It has been observe K 
Contour, H. 

Moneen S ite 
tefiparojmolt ‘our things" Gel ч 
(polite) ; j У. 


f H 
halomaj to} th Tm about to speak 


БЫ i D olite) 
-—— will [speak falsely: @ 
Alan Pence 333 


The mid-low nuclear contour has a meaning of emphatic statement. It 
has been observed only following the stepping prenuclear contour. At a 
P-phrase boundary it stops at upper low rather than gliding to extra-low 
as it does at a P-sentence boundary. 


refliparo E *our things? (emphatic) 
¢ufpumakih | /heh ‘He was in the men’s house.’ (emphatic) 
uo 


Joch ‘They did it.’ (emphatic) 


The mid-high nuclear contour has meanings of polite request, polite 
question, or nonemphatic call. It occurs only following the stepping pre- 
nuclear contour. When it occurs on a final non-stressed СУ syllable, it has 
either of the latter two meanings. 


те Прато |/moit ‘our things" (polite question) 
ifti | Ља Гу ‘the firewood? (polite question) 
hafrangije/ngi! ‘You lit it. (polite question) 


dro mate ‘Companion’ (non-emphatic call) 


The mid-high-low nuclear contour has a meaning of deep feeling such 
as intense sympathy or desire. It has been observed following only the 
stepping prenuclear contour. It usually occurs spread over a sequence of 
two contiguous vowels, but has also been observed on a single syllable. 


Because it may occur on one syllable, it must be treated as a contour of 
three intonemes. 


zejiparo | ој *our things? (feeling) 

Kfivok velati пап} “Fill a bag and give it to me.” (feeling) 
рој "ка f kams Лоо "Those ones had pain.” (feeling) 

ро JV} “This” (feeling) 


The high-high-mid nuclear contour is used as an intense or distant 
call. It occurs on a final non-stressed СУ syllable or Spread over two V 
syllables, and is often spoken in a falsetto voice. It has been observed 
following the stepping and falling prenuclear contours. Because it always 


334 Universality 


occurs lengthened, it has been interpreted as a sequenceof threeintonemes, 
in contrast to the high-mid nuclear contour. 


Нетер indt deiere ET Ehe е сунг ESSEN 
$1moopai ji tui tui ji eng lang -gio/gi ! 


call used when felling trees (distant call) 


пало Та Т1 'Varoa' (distant call) | 

The mid-low-mid nuclear contour is used as an excited sequence, both 
in listing items and as a type of hesitation. It has been observed following 
the stepping and level prenuclear contours. It occurs on the two final 
syllables of a P-phrase, either the stressed syllable and a CV syllable 
Containing /a/, or two final contiguous V syllables, the second of which is 
Тај. When it occurs as a hesitation, it is often closed sharply by a glottal 
stop. The low to mid up-glide occurs on the final vowel, and often a mid 


to low glide occurs on the preceding syllable. 
хе tiparo Ymota_‘our things’ (ехойей sequence) 


ae Jos ‘blood’ (excited sequence) 
[d 
Spi/rara_ “sugar? (excited sequence) 
bed 


Табари | » (excited sequence) 
"1 Јаһари Lipua_‘the flute? (€: 


Implicatio 

ns nen 
itisi ask what significance 

Having completed a study of this type, if 2 Vien to be four ea areas 

it has in the overall linguistic picture. The 

Of usefulness: ` f 

1. No description of Kunimaipa phonology could be complete without ay 

нн of pitch signals ecessary to reproduce rhythmic 
n i language peak: hom the language 

and eae T acceptable DI TERTE Tee ee 

is their mother tongue ed by large numbers of non- 

i : a not be learn ЖЕЕ 

„Э uinea Jan and thus vill n ‘ch is done in it will be furthered 

digenes, any further analytical EC 
У ап understanding and us? of this 


, it is D 


Alan Pence 335 


3. Various analysts are attempting to bring features of intonation in some 1 
way into their grammatical description. In a language like Kunimaipa 
where intonation obviously has a great deal of importance, a thorough 
analysis is absolutely necessary if intonation is going to be used with 
accuracy in the grammatical description. The importance of intonation, 
however, varies from language to language. 


4. Though this study does not blaze any new trails in phonological 
analysis, it does show that a systematic approach to intonation is possible. 
This may help others struggling with the same problems. | 


References 


Pike, К. L. (1945), The Intonation of American English, University of Michigan 
Press. 


Pike, К. І. (1962), ‘Practical Phonetics of rhythm waves’, Phonetica, vol. 8, 
рр. 9-30. 


336 Universality 


19 Isamu Abe 


Intonational Patterns of English and Japanese 


Isamu Abe, ‘Intonational i 
5 patterns of English and Japanese’, Word, vol. 1 
December 1955, pp. 386-98. Kos 


In the domain of pitch phenomena, Japanese has tone and intonation.! 
On the lexical level tone distinguishes the meaning of one word from that 
of an otherwise homophonous word. Intonation, on the other hand, is a 
Phonetic manifestation (pitch being its instrument) of the attitude the 
Speaker assumes toward the things spoken about or toward the auditor. 
It follows, therefore, that tone and intonation are functionally different 
from each other, in spite of the fact that they are characterized by the ebb 


and flow of pitch. 

The analysis of tone in Jap: 
appear to be two kinds of approac 
(2) kinetic (or motional) analysis. 
posit a certain number of syllabic pi 


each of the component syllables of a wort 
Variously assigned — some prefer three, some two, some four, and the 


Question as to how many pitch levels are most suitable is still going on. 
However, there is a tendency among some Japanese students of phonetics 
to favor two levels (high and low). Now, the advocates of the second view 
De, kinetic analysis) maintain that intra- OF intersyllabic movement of 
Pitch is more vital for linguistic analysis than the positional analysis just 


mention 27, Kooichi Miyata remarked that it is sufficient 
асла с, here in a given word there occurs а signifi- 


for tonal i iscover W ° 
cant Kap T one syllable and another — i.e. a fall. He 
Tegarded other pitch phenomena characterizing the word as of but minor 
Consequence. Quite recently Shin Kawakami questioned the value of 
the prevalent theory of level analysis. He states that “itis impossible to 
think that there is such a thing 25 the height of a syllable. The important 
i ight of a certain point ofa 


hing i; А is only the hei 
s there 15 ч R an 
wd s у SE various aspects of intrasyllablic tonal shift in 
an accent hr » tis interesting to note, in passing, that Miyata's analysis 
phrase. ne as ‘accent’. Throughout this 


anese has a considerable history. There 
h – (1) static (or positional) analysis and 
Phoneticians favoring the first method 
tch levels and assign one of them to 
d. The number of pitch levels is 


word to! 
ub Japanese phoneticians usually we in compliance with the customary European 
Per, the term ‘tone’ will be emm intonation". 


terminology, and to distinguish ? 
Isamu Abe 337 


foreshadows the current phonemic approach now being experimented 
with in Japan. Shiroo Hattori (1954), for example, applies the term ‘accent 
nucleus’ to a syllable (he uses the term ‘mora’) which is or may be accom- 
panied by another mora lower in pitch; e.g. in Karakasa, the second ka 
is high, and за is low, so this ka constitutes the accent nucleus. He states, 
in effect, that one should consider the following two points to know the 
distinctive features of the accentual patterns of Japanese words: (1) Is 
there an accent nucleus? and (2) If it is present, on what mora will it be 
found? 

These views have been quoted in order to show the general character of 
Japanese word tone. But the main concern of the present paper is intona- 
tion, 

Intonation is very different from tone as we have just described it. 
Intonation may be said to have an ‘emotive’ function. Its purpose is to 
supply a delicate shade of meaning to the utterance upon which it is super- 
imposed, It belongs to the field of ‘expression’ — being the reflection of the 
speaker's feelings or attitudes. 

Tone and intonation often involve different aspects of pitch. Let us take 
up the word ame (rain). For comprehending the tonal structure of this 
word, all that is required is to notice that it has a perceptible fall in pitch 
between the element a and the element me. However, the absolute interval 
(i.e. musical range) of the fall is immaterial for tonal composition. Tonal 
value is a ‘directional’ value. And the so-called tonal pattern is an abstrac- 
tion from the actually collected facts of similar directional compositions. 
On the other hand, the actual modifications of tonal range or the ‘key’ of 
tone is intimately correlated with problems of intonation. For example, 
when we say that a widened pitch range is suggestive of the speaker's 
heightened feelings — animation, anger, etc., or that a narrowed range 
expresses irony, indifference, etc., we are speaking of intonation. All this 
has no direct relation whatsoever to the composition of tone. 

Japanese tone and intonation, both being pitch phenomena, overlap ОГ 
clash in some instances. Nevertheless, it may safely be said that each has its 
own ground to maintain, as a general rule. Observe, for example, Kaku 
Jimbo’s remark that ‘sentence-intonation can never cause an inherent high 
pitch to become lower than a neighbouring lower pitch, or vice versa’ 
(1925). Superimposed upon the word ame (rain), for example, a rising 
intonation merely adds a rising tail to the final syllable of the word: 
Superimposed upon the word ame (candy) (slightly rising tone), a rising 
intonation follows and elongates the direction of the word tone, bringing 
the last syllable yet higher. In any case, the intonational addition in nO 
wise affects the basic tonal structure of these words, In a certain Chinese 

dialect, similar phenomena appear to occur. Chao observes that in tbe 


338 Universality 


sentence Zhege хао! (It is decidedly good!) a falling intonation is not 
added simultaneously to the last syllable, but is joined on successively, 


after the word tone is completed — thus xao! (УУ) (1933). (Be it noted that 


Japanese is not, strictly speaking, a tone language like Chinese, but 
similarities of this kind аге still there.) 

It has been hypothetically stated that intonation is emotional in origin – 
as it apparently is even now. Some phonologists are not inclined to treat 


intonation as an arbitrary system of signs. The late Arisaka, for example, 
maintains that there exist quite natural relationships between tunes and 
ly physiological; and for this 


feelings, According to him, intonation is pure gic i 
reason, even if it has a number of qualities in common exhibited by its users, 


it is not a social system — and therefore, it is not а phonological system 
which is part of this social system (1940, pp. 128-31). Bally, on the other 
hand, asserts that ‘on sait que, d'une maniére générale, les intonations 
engendrées par l'émotion ne restent pas l'apanage du langage instinctif: 
elles pénètrent, sous une forme schématisée, dans la langue méme; tous les 
idiomes possèdent une jeu varié de mélodies, fixées par l'usage et exprimant 
m sentiments déterminés’ (Bally, Se SN TER are опе 
а ‘ sesatisfied with the usual dichotomous c'asst- 
See АД Gm that there should be recognized (1) 


fication ne and i i lai ntaneou: 
to d ntonation), claim h 
( : ›пайоп (natural and spontaneous speec 


lone (wor 2) intona | ntaneou 
melody), UE e tune (conventionalized or EXE ха 
melody inherent in a given community) (e.g. Terakawa, 1945, pp. 305-7). 

tains а few arbitrary Uses, but they are 
tions; even the arbitrary uses may 
tent with the nervous interpretation (for 
led, i.e. tension relaxes). 
r intonation is a cultural 


heri ? 
eritage and so ‘learned’ T too close fo allow us to deny all physio- 
logical determination (1949; 1947). Perhaps this is a good way to look at 
intonation is stage of analysis- 3 
Prior to oe E too sweeping à conclusion about Mab cepa 
intonation we dno it imperative ASEIN ie m with Ge 
Way intonations work in various languages. A МЕ hope, with а so PET 
Of the results thus secured, will provide us; tion as it is used in human 
Tore detailed and accurate Pict’ шире 


Speech у : difies in a delicate 
S di 1 1 utterance, intonation mo 
Sed in conjunction with an jal values) of that utterance or even 


»( n : 
Manner the ‘contents’ DE ree ШР that of late in 


Supersedes them. It is wort ! 
Isamu Abe 339 


tonation has been ` 


ME hee 


given more attention than ever before for the analysis of sentence types. It 
is to be noted that the aim of an utterance is not always attained by uni- 
lateral means. The genuine purport of language for communication is ex- 
pressed by a combination of a specific choice of lexical items (words), their 
proper arrangement (syntax), and sound. As an agglutinative language, 
Japanese possesses a large number of what may be called enclitic elements 


(particles being the most typical of them) — and some of these elements. 


make intelligible the grammatical construction of an utterance, while others 
impart an emotional coloring to the expression. In fact, thelatterare, in the 
minds of ordinary persons, a sort of ‘visible’ intonation. It may therefore 
be argued that, in some instances, the intonation would be allowed with- 
out much damage to meaning to play second fiddle so long as an adequate 
selection of words is made. Ryuuzaburoo Taguchi has observed that 
English contains varieties of spoken forms beyond mere words — and these 
are capable of expressing by vocal means diverse feelings like joy and anger. 
On the other hand, we have in Japanese such a vast field of emotional ex- 
pression relying on ‘phonogramic’ words that we could, if we so desired, 
dispense with other devices of expression and might still be quite able to 
meet our communicative purpose. In effect, he appears to assume that 
in English emotional expressions are frequently formulated by cumulative 
means, while in Japanese they are mostly formulated by linear devices- 
Motoki Tokieda states, in effect, that ‘if we had not developed in this 
language various types of word forms to indicate sentence structures, 
we would have developed other means of doing so — e.g. intonation’ 
(1950, p. 357). He says we ћауе a linear composition in Darekakitaka? (Ка 
is a question particle, and we have a level low intonation at the end), but 
that we have a cumulative composition in Darekakita? (a question here 
signaled by a rising intonation). There appears to be much to be said for 
these statements as far as actual practice goes. It is true that Japanese 
is shackled with varieties of emotional and connective elements, and 
that we actually make conscious or unconscious use of them to suit 
specific needs. Unless actually pronounced, it would not be possible tO 
determine, without a proper context to supplement its meaning, whether 
the expression Darekakita is a question (‘Did somebody come?) or à 
statement (‘Somebody came."). The addition of the particle ka usually turns 
this into a question, with its accompanying falling intonation. (Compare 
this with Russian utterances containing the particle li.) However, it i5 
equally true that Darekakitaka could be pronounced with a rising іп“ 
tonation as a normal question of this type. So it may safely be assumed that 
intonation, if it is employed at all, has a value of its own as a psychological 
pitch curve. This seems to be especially true of the English language. FO 
example, the utterance Is she happy? is an interrogative sentence, Y? 


340 Universality 


different intonati i iti 
кыз E 
De Hon. wm n. Hans urath distinguishes (1) syntactical 
Q) E ie ich expresses the syntactical relation between phrases, and 
Becker Ta beer" which expresses the feeling or attitude of the 
oe kt the idea expressed or the person addressed (1930). His 
Pausen at asa rule emotional and syntactical intonation do not clash, 
Decano otional intonation may run counter to the normal syntactical 
Дат, mds and reverse it appears to have some bearing on the question 
Sieg ег consideration. Here we recall a remark of Gardiner's that, 
Do m form (intonation) predominates over the locutional form 
SS and that elocutional form provides the dominant clue to the special 
d of a sentence (1932, p. 201). 
«s а we shall list some of the intonational patterns — if patterns they 
оа some of the typical Japanese expressions. We restrict the examples 
GE short ones to simplify the description. Then we shall 
should € their intonational patterns with those of their equivalents — or we 
“Mrs rather say ‘near’ equivalents — in English. Sumisusanwa senseedesu. 
eu d is a teacher.’ / Tarooga kita. “Tatoo came." | A to B too tasuto 
о “А and B таке С.’ / Moojiki haruni naru. *Spring will come 
soon.’ / Amari ookuwa arimasen. ‘There is not much of it.’ In 


ordi à 3 
Чїпагу expressions like these (i.e. *colorless' statements made in a matter- 
e utterance with a falling in- 


of- 

ESCH way), the speaker usually finishes th 

th ation. He simply has said what he has to say — no more, no less — and 
еге naturally occurs а psychological pause sentence-finally, and physio- 


ogically, there comes relaxation, too. Nor would any particularimplication 
This is exactly what characterizes 


SE to the whole expression. Thi i 
К КОТКЫ in English. (Try this intonation on the above trans- 
The abóve examples would be given subtle emotional colorings by the 
addition of words (mostly particles), and this is what usually happens in 
wee conversation. Ў 18 E 
th а rising or a raised pitch is employed in such instances, probability is 
at the speaker is appealing (strongly) to the hearer or calling for the 
Pu. ‘participation’ in or sympathy with his view or statement. Thus: 
10 B too tasuto С ni narimasune would be a reassuring remark or even à 
alf question if pronounced with a rising intonation. Similarly: Moojiki 


runi патцуо. 


Again the rising intonation nnotation of finality which is 


lacks that со! 
C*pressed by a fall in pitch. Gë Tabun. “Probably.” | Tie. ‘No. 3 
€ are capable of expressing à variety of implications 1n English merely 
haying rising intonations superimposed: e.g I'm not “doing anything. 
Courteous refutation)/ "Dr 4 ау. (careful, deliberate statement) | These 


Isamu Abe 341 


things sometimes ‘happen, you "know. (reassuring statement) / You're "look- 
ing for the *money, I sup'pose. (doubtful statement). б 

Some of the above-mentioned examples may be changed to questions 
merely by the superimposition of a rising intonation. Tarooga kita? 
"Taroo came?’ / Moosugu haruni naru? ‘Spring will come round soon?’/ 
Tabun? ‘Did you say *probably"?' It appears that the formation of 
‘questions’ by such means is often resorted to in various languages: e.g. 
Russian vy student (statement) and vy student? (question). I. C. Ward 
Observes a similar phenomenon in the Yoruba language, too. Note that if 
the question is asked as a final alternative, one would not necessarily raise 
one's voice: e.g. ‘Is it a banana?’ ‘No.’ ‘Is it a pear?’ ‘No.’ Dewa ringo 
(deshoo)? (‘Then, it's an apple, isn't it?’) 

The question particle ka is often used for interrogation in general and 
Special questions, either with a rising or a falling intonation, but some- 
times with different implications. In informal speech this ka is frequently 
dispensed with. Edwards’s observation that interrogative sentences which 
do not contain interrogative words or ka are very rare is riot true of the 
current usage. Compare the following examples: (‘I can’t tell a flatfish 
from a sole. Tell me . . .") Korega hirame desuka? (rise or sustained pitch), 
“Ts this a flatfish?’ / (‘Now I see the difference between a flatfish and a sole. 
So..." Korega hirame desuka! (fall) ‘This is a flatfish, isn't it?" The 
English pattern Zs "that ‘so? is an ordinary question similar to It /is?, 
while Is ‘that ‘so? would rarely be used for disagreement; it rather shows; 
almost invariably, astonishment but willingness to agree, as would You 
‘don’t ‘say? under the same circumstances. (The first example has a rise; 
the second falls.) 

With reference to *command-request* expressions, similar phenomena 
appear to be observable in Japanese and English intonation, The falling 
intonation would range from a frank informal request to even a brusque 
command; while the rising intonation would sound less informal, or would 
often afford a feeling of courteous request. However, too much ‘appealing- 
ness’ on the part of the speaker would work the other way round. It would 
then become too importunate a request, and would, in such an instance; 
sound hardly cordial. Try this on Ohairinasai. ‘Please come in,’ This таў 
be pronounced with either a falling or a rising intonation.? It would seem 


2. If the expression Ohairinasai is pronounced wit 
following the *accented' element Occurs between sa and i, and the final i carries the 
voice further down to the bottom (falling intonation). If a rising intonation is supe! 
imposed upon this identical expression, the voice glides up from sa to i, instead 0 
rising from i, as might be expected. (This is an example of temporary tonal distur- 
bance.) The reason may be that here sai is treated as a monosyllable. If Ohairinas4! 


is an echo-question, we notice a fall between sa and i before the voice rises sentence 
finally. 


А 


h a falling intonation, the ‘fall’ 


342 Universality 


^ eg 


that a genuine command that calls for immediate obedience is invariably 
pronounced with a falling intonation. (А bark would hardly be accom- 
panied by an appeasing tone!) Compare Kaere (or Kaereyo) “Со back’ ог 
Koi. ‘Come here’. (fall) with Kiotsukete okaeri. (Lit.) “Ве careful and go 
back? [May be said to a child on parting] (fall ог, more often, rise). Similar 
Usage appears to obtain in English expressions of this type. The present 
Writer has elsewhere dealt in some detail with English patterns (Abe, 1954). 
To cite а few examples: “Give me те ‘knife.[ Think what you are ‘saying.| 
Don't be so particular. (Тај Ргеазе bring me the ‘water, Tom.[ Do sit 
down.| "Don't trouble to ‘answer it. [Don't ‘worry. (rise). 

Finally a word about intonational patterns of exclamatory expressions, 

he term ‘exclamatory’ is vague and loosely defined; but here we merely 
follow the conventional classification. We limit ourselves to a citation of a 
few examples: Мато kiree-nandaroo! ‘How lovely!” /Maa! ‘Oh dear!” 
(fall). Tt will be observed that the English expressions like the above end 


With a falling pitch, too. 
We shall now pass to the final and non-final aspects of Japanese in- 
tonation, 
yoice usually goes down to the 


In an un BAG > th 
emphatic ‘statement’ the 

bottom of the speaker's pitch range as he concludes the statement. How- 

SN there is a point that must be borne in mind here: e.g. ( DAE started 

© Га?” eor ај what do you like to eat?) Ame. “Candy.” The 

) Mme; Reine ОЦА he tone of ame (as we have 


first ex. ince t 
ample offers no knotty problem, sinc® "E f 
Observed) follows the direction of the falling intonation. In the second 


example, the tone of ame is the exact reverse of the 
i » on the other hand, e it is not customary to terminate 


alling in 5 E esi 
to; . In Japan f, aM у 
is rising FPES s WE upon ita too decided falling intonation. 
90 much Lettre would indeed cause the uM p EE 
exceptionally emphatic.) Instrumentally, there | ym EH i ST 
Mediately followi S е, but perce! з 3 
F e element me, E ; i i 
Picuous HN bun is that of 2 sustained pitch, Ka Ki SST 
dë the rise indica of a rising RUE E SC det WIES 
Ca $ А ite the : 5 
ndy) has a falling intonation sentence inal fall. In this particular 
orwegian (see pp. 432-3). 


8 
arily Correspond to the expected LV Bast N 
SE т raised) pitch or a rising 

al non-finality 


Te: 
Spect, Japanese appears to rese ШО 
Ditch Sentence-medial position E Cs x employed to sign 
or ] рис а - — be it the type often 
and/or я Ds SE d passionate e ок A due ati 
ard in narrative reading style О! ШО 
rrative г gen it) 


[ ould end in a suspended 
p тїтази. Cel 


But if this vowel happens 


SUIT 
i vikewise, the utterance 5076 t pro х 
шып a statement) if the final vowel Lat BUT we haye the regular fall (low) on и 
*tain its ful "tv. as in care 
1 vowel quality, 2: 
Isamu Abe 343 


> РӘ 


device (i.e. suspended level pitch) appears to be more prevalent either as an 
ordinary ‘suspensive’ pattern or as а ‘hesitation’ contour. Unless some 
manneristic ‘particles’ of an emotional nature are used phrase-finally, а 
rising pattern is apparently not used in this position so frequently. Mima- 
shilaga kiniirimasendeshita. “1 saw it, but I didn't like it.’ / Solonidete 
sampoo shimashitayo. ‘I went out, and took a walk.’ 

We have, of course, a rising or a raised pitch occurring sentence medially. 
‚ This ‘positive’ intonational pattern contrasts with the ‘negative’ one just 
mentioned. In fact, this intonation is actually used when the speaker 
wishes particularly to call for the auditor’s attention, or to imply that 
something is still to come, or even to indicate the location of emphasis. 
This pattern is typical of a pompous, oratorical style — a pattern by means 
of which the speaker usually endeavors to influence his audience. It is also 
Suggestive of an ‘advertising’ voice. This positive intonation is particularly 
noteworthy in such a case as the following: Arutokoroni оппапокога 
sundeimashita. (Lit.) * Certain-place-at [rise] girl [rise] living-was’ = ‘There 
lived at a certain place a little girl.” This example may be matched with 
English ‘Arthur ‘stood and ‘watched them ‘hurry away. This example is 
from Mrs Uldall’s work. She says that rising unstressed syllables add 
Surprise or interest to statements on the falling tune and calls the above 
intonation ‘fairy-tale’ intonation (Uldall, 1939). In English we see that 
either a suspended or a rising intonation occurs with slightly different 
implications. We even have a complete fall. For detailed accounts of this 
point see, for example, Pike’s analysis of various contours (e.g. 2-4, 2-3, 
2-3-2, 2-4-3) in American English (Pike, 1949). 

We shall finally attempt to compare, in some details, the intonation of 
the so-called Special Questions in English (i.e. questions beginning with 
interrogative words such as what, when, why, who, etc.) and their (near) 
equivalents in Japanese. This would enable us to have a fairly good 
panoramic view of some of the typical Japanese intonational patterns and 
their implications. 

First, there exists what might conveniently be termed a gradually falling 
intonation. This features normal questions of information-seeking type 
Изи yukuno? “When are you ‘going?’ / Dokoni atta? ‘Where was that?’. It 
will be noticed that both Japanese and English employ a similar pattern 0? 
such occasions. Sweet remarks that ‘questions which are begun with ап 
interrogative word have the falling tone because they can be regarded 25 
commands.’ Elsewhere, he says: ‘The brevity and imperativeness of special 
interrogative sentences such as what is his name? is often avoided bY 
substituting a longer general interrogative form: can you tell me what his 
пате is?’ (Sweet, 1890, p. 32; 1922, pt 2, p. 39). This falling intonation is 
often quite perfunctory, and will be taken as such, It is interesting that 


344 Universality 


similar patterns are employed in many other languages: e.g. Waar, kind? 
| Quién ha venido? | Wann soll ich kommen? | Quel йге avez-vous? | Gde oni? 
, Ifa slight rise, instead of a fall, is added to the utterance, it would 
impart an effect of curiosity or cordiality. Try this on the examples just 
given. And compare these with English How's your ‘mother? | What's 
the ‘time? where we note not infrequently a rising intonation. This rise 
would sound, depending upon the situation in which it is used, either 
Pleading, wheedling, or even importunate. We often hear people say Doo? 
How do you like this?'/Daare? * V'ho's this?’ (May be said to a visitor 
at the door who cannot be identified). 
ч А heightened tune is suggestive of intensified feelings - animation, anger, 
irony, exultation, and what not. Superimposed upon an interrogative 
Word — upon the ‘accented’ element which is as often as not elongated to 
carry the intonation — such a raised or rising pitch gives, in consequence, 
what we might term a convex or ‘surgy’ intonation. Na(a)nio miteirund- 
desu(ka)? “Whatever are you looking at?” / Do(o)koni itteta? ‘Where on 
earth have you been?’ / Na(a)ze naiteiruno ? ‘What on earth are you crying 
for? Either a rising or a falling intonation may be added to these utter- 
ances sentence-finally. The general effect given is that of curiosity, the tail 
tise imparting a greater degree of that feeling. Cf. “Who could “that ‘be? ` 
If the interrogative word is pronounced with a rapid decrescendo of 
Voice ~ that presupposes the existence of a high pitch and stress – and with 


à falling sentence-final intonation, а note of accusation would be intro- 
duced into the utterance. Nandesuka! * What а thing to say! | Nanio mile- 
* What are you looking at? Don't look 


rundesu(ka) ? (Possible implication :) ' 1 ў 
Off!’ Mori remarks that Nanio-suruka may be uttered in a falling tone 
as a threat, I presume that Мато will be heavily stressed. Referring to the 
example Who wrote this (where Who is high and wrote this is low), Bush 
Observes that if an American speaks this way, he is expressing disapproval 
of whoever wrote it (Bush, 1952). v у 
E сона aS of pitch at the end would indicate that. the speaker 
'S surprised or is highly incredulous. Utterances upon which such an 
intonation is superimposed would range from a mere request for repetition 
Of the preceding remark, either in its entirety or in part, to a kind of retort 
ог challenge. In other words, this pattern is likely to become rhetorical in 


Nature, | E 
Мато watashiga shiteru? ‘What am I doing? / EE What? 
Compare (Hanako's coming.) Dare? ‘Who? (rise) ( W S id you ки 
Was coming?’) with (‘Somebody’s coming.) Deu puris у | 
rise) = “Tell me who this somebody is. ]t appears gi 


correspo; outs. 
Ponds roughly to t try за 


We shall, b f experimen! me of these intonations on the 
, by way of ex 


Isamu Abe 345 


1-19 \ 
, \ 


А а 
utterance Мато yatteruno? (“What you are doing?’). ge te 2d 
normal colorless question asking for a piece of па pe ege 
at the end would make this expression sound cordial -oritm pom я 
that the speaker is curious. A ‘gentle’ convex intonation on о а 
Naanio would turn the expression into an intensely curious Rom 1 slight 
sentence-final intonation may be either falling or rising xd Mee 
difference in the degree of interest shown by the speaker. Heavily 3 atn 
na plus a rapid glide down to the low pitch of то and the крп Ne. 
would suggest that the speaker has lost his temper. If a ponyer ` s "e 
(actually a raised pitch) is superimposed on the element -ter- ма 
result that the peak in the group yatteruno is higher than the peak ја ihe 
group nanio) the expression might be taken as a sarcastic сонан о E 
behavior of the person addressed. Compare this last example with Eng 
“What are you do*ing? mentioned by Bolinger (1948). 4 dif 

The above observations are confined to a few typical expressions, pes 
conjecture that other intonational patterns — and consequently e is 
interpretations — might be possible, too. I hope this brief sketch of m E 
free from too subjective impressions. A more comprehensive fhe RE 
yet remains to be attempted of Japanese intonational patterns whic! Ze 
likely to sound UN-English. This brief sketch is a tentative attemp! A 
verify to what extent intonations — that is, psychological pitch сигуе$ а 
they are usually called — may be declared international. And limited € 

"this article is in scope, there would appear in many points rather stri ge: 
similarities in the way intonation curves аге employed in Pugliese 5 
Japanese. (This gives a promising hint for our further studies.) This 9 
not, of course, imply that parallel expressions in these languages Wer. 
Sound quite alike if actually pronounced. I don't think they would; deta A 
differ, and various other linguistic factors add to melodic composition as A 
whole. The present writer merely wishes to emphasize that the Heer, 
cal channeling of voice in English and Japanese seems to have muc! п? 
common - particularly with reference to such crucial points as “ questio 


D пе! 
and ‘statement’ tunes, And these, we presume, constitute the very ker 
of human speech. 


—. References 

4 ABE, I. (1954), “Intonation of “request-command” expressions in English’, Bull. 
` Phonetic Society of Japan, no. 85. 

d ARISAKA, Н. (1940), The Theory of Phonology. 


BALLY, C. (1935), Le Langage et la vie, Romanica Helvetica, vol. 1, Zurich, 
M. Niehans. 


= BoLiNGzn, D. L. (1947), “Comments on Pike's American English intonation’, 
Studies in Linguistics, vol. 5, no. 3, pp. 69-78. 


846 Universality 


Botincer, D. L. (1948), ‘Intonation of accosting questions’, English Studies, vol. 
29, no. 4, pp. 109-14. 

Воіімсев, D. L. (1949), ‘Intonation and analysis’, Word, vol. 5, no. 3, pp. 248-54. 

Вин, Н. C. (1952), * Connotations of the stressed interrogatives in English’, 
The Rising Generation, vol. 48, по. 2. 

Снло, У, R. (1933), ‘Tone and intonation in Chinese’, Bull. National Research 
Institute of History and Philosophy. ( 

GanDiNER, A. H. (1932), The Theory of Speech and Language, Clarendon Press. 

H ATTORI, S. (1954), ‘Japanese accent from a phonemic point of view’, /nquiries 
into the Japanese Language, no. 2. А 

Jimno, К. (1925), ‘The word tone of the standard Japanese language', Bull. 
School of Oriental Studies, vol. 3, no. 4 

K ui wA, Н. (1930), "А specimen of Ohio 5 
Studies, Waverly Press. 

Mans, S. Е. (1952), ‘Morphonem! 
Language Supplement to Language 
ХАТА, K, (1927), "My view cof Japart 


peech', Curme Volume of Linguistic 


ics of standard colloquial Japanese’, 


Dissertation, по. 97. 
се accent’, Study of Sounds, nos 1, 2 and 


Pixs, К. L. (1949), The Intonation of American English, University of Michigan 


Press, 2nd edn. - 
SWEET, H. (1890), А Primer of Spoken English, EEN 
SWEET, H, (1922), A New English Grammar, СЫ m d 
TERAKA wA, К. (1945), A Study of Ën A (RON, 


e + of Japanese Philology-" _ і 2 
Bd Ced DE The De of American English', unpublished thesis. 


Isamu Abe 347 


Зи 
to AA zi ГУ 


FTT 


20 Kerstin Hadding and Michael Studdert-Kennedy \ 


Ап Experimental Study of Some Intonation Contours 


i i i - dy, *An experimental study of some 
Kerstin Hadding and Michael Studdert-Kennedy, 1 

intonation contours’, Phonetica, vol. 11, 1964, pp. 175-85, published by S. Karger, 
Basel. 


Introduction 


| Questions! are often said to be distinguished from statements by a terminal 
rise in fundamental frequency (f,) as against a terminal fall. However, 
“questions may also be distinguished by a comparatively high f, throughout 

| the utterance (Hermann, 1942). Spectrographic analyses of Swedish speech 
have shown that, in this language, questions tend to be spoken on a higher 


f, than statements, usually ending in a moderate tise (Hadding-Koch, 
1961). 


ў In the description we postulate fo 
p (highest).? Thus the sequence 3 4243 
; of a Swedish question. The arrow 


€ point where the terminal glide begins. 
Swedish question; 2 31} or 


Similarly, a typical American-English question is said to display a con- 
tinuously rising contour that may be notated, for example, 2 23} or 2 231* 
(Pike, 1945; Bronstein, 1960). A typical American-English Statement may 

be notated exactly as in Swedish, 2 31). | 
However, polite statements in Swedish, though spoken on a lower 

1 frequency level than questions, quite often end with a rise, In American 


English also, terminal rises are reported to occur in statements (Uldall, 


1. ‘Question’ refers throughout to so-called Yes-No questions, 
2. The acoustic correlates of intonation are said to be c 
| three variables: fundamental frequenc; › intensity, duration, with fundamental fre- 
quency being the stróngest single cue (Bolinger, 1958 ; Denes, 1959; Denes and Milton- 
Williams, 1962). The present study is concerned with only one of these variables, 
fundamental frequency, and the term ‘in 
fundamental frequency. 


hanges in опе ог more of 


intonation contour* Tefers to contours of 


348 Universality 


| 
D 3 


1962). In fact, Uldall, using synthetic speech, demonstrated that an 
utterance could have quite a large terminal rise and still be heard as a 
Statement, if the rise was preceded by a high fall. If there was no pitch 
higher than the end point of the terminal rise, the utterance tended to be 
heard as a question. 

These facts concerning both Swedish and American English suggest that 
not only the direction and range of the terminal glide, but the shape and 
level of the entire contour affect listeners’ judgements (Garding and 
Abramson, 1960; Hadding-Koch, 1961). The present experiment was 
designed to explore this notion in more detail by means of synthetic inton- 
ation contours, and to compare for Swedish and American listeners, their 
Preferred question and statement contours. In addition, as a partial check 
оп the degree to which listeners could actually hear the detailed tonal 
movements involved, some purely psychophysical data were collected. 


Method 

The utterance För Jane [foe ^Jein] = for Jane, spoken ona monotone and 
in such a way as to be acceptable as Swedish to Swedes, as American to 
Americans was recorded on magnetic tape. From this recording forty-two 
different fundamental frequency contours, simulating Swedish intonation, 
Were prepared by a procedure described below. The f, eux Soe 
Оп detailed spectrographic analyses of a long sample g the Gw e 
SPeaker's natural speech. The correspondences between leve nota SE 

fundamental frequency derived from this analysis are given in 1250 4. 


Table 1 Correspondences between level notation and fundamental 


frequency in hertz (from Hadding-Koch, 1961) 
-°Чшепсу їй hertz (from Нада 


Fundamental frequency 
Level in hertz 
370 and above 
260-370 
175-260 
175 and below 


me Јо 


T i i tour, 2421“, and statement contour, 2 31], 
dele eme [Un the first number represents the level 


Were used, In the present experiment t ) 
9f the рге EAE Für, the second number the level of the aru 
‘peak’, the third number the level of the ‘turning point (Gárding, 1960) 

é he poles of ideal question and 


efore the termi ide. Between t 

erminal glide. p : у 

Statement, various f. values at peak, turning point and end point were 
й о 


Kerstin Hadding and Michael Studdert-Kennedy 349 


Г" 


introduced. Diagrams of the contours are reproduced above Figures 1 and 
2. All contours started at a fundamental frequency of 250 Hz, sustained 
for 140 ms over Für. They then rose to a peak of either 370 Hz (the S, or 
superhigh series of contours) or 310 Hz (the H, or high, series), dropped to 
one of three turning points: 130 Hz (51 and HI series), 175 Hz (S2 and H2 
series), or 220 Hz (S3 and H3 series), and then proceeded to one of seven 
end points between 130 Hz and 370 Hz. The rise and fall on either side of 
the peak lasted for 300 ms, the terminal rise or fall, from turning point to 
end point, lasted 200 ms. The actual contours were rounded at peak and 
turning point rather than pointed as in the schematic contours above the 
figures. 
The intonation was varied by means of the Intonator connected with 
.the Vocoder at Haskins Laboratories, New York. The Vocoder first 
analyses a speech sample in a bank of filters and then reconstitutes it in 
simplified form on the basis of information obtained from the analysis 
(Dudley, 1939; Borst and Cooper, 1957). The fundamental frequency of 
the output is controlled bythe Intonator, and may be varied independently 


of other characteristics of the speech sample. Thus, the same utterance 
шау be given any desired number of different fundamental frequency 
patterns. Instructions to the Intonator 


ue j ) are transmitted through photo- 
electric tubes responding to light reflected from a contour painted on an 
acetate loop. Also attached to the loop is 


two-category semantic judgements 


Swedish 


8 1007 53 052 57 
E 100| 526. SE EE BA 

т 9r v A 

5 во р 

5 

ën Al j 
= 70 , 
$ 

8 бој 

о 

8 

5 

А 


У 


" 
180 220 260 


052 sn 
S3 27 e 


бе е 


endpoint minus turning point in Hz 


O question 


9 statement 
ntand question responses 


gative) in hertz of f, 
Parameters of the curves are turning 


ЖАНАШ + * 


fwo-category semantic judgements 


Г Hi 
8 100} чы ER он2 А 
5 ao ^ 
2 4 
|. 5 80 РА / 
© dé / 
ж 2 PAN Р 
8 60 \ 
5 / 
g 50 d Swedish 
m 


Oquestion 
9 statement 


` AEMP Peak f, at310 hertz: percentag 
as a function of the terminalrise (positive) or fall ive)i 
(endpoint, minus turning point fo). Param Шш а Hz of f 
ШЫ 130 Hz (H1), 175 Hz (H2) and 220 H. 


Swedish two-category semantic judgements 


7370 


A“. 
М /,310 


РА \ P 

z М 7275 
p 

250 \ 222220 


175 81:145 
7-130 


% responses in indicated class 


endpoint minus turning point in Hz 


Oquestion 
9 statement 
Figur, 
93 Percentage of statement and question responses as à function of 
nee minal rise (positive) or fall (negative) in hertz of AO 
SEDI 70 Hz (S) and 310 Hz 
MAL Ыр Rel к ndicate the points 


Point y 4 
о Constant at 175 hertz, аге compared. The crosses ! d | 
Ubjective equality for the US subjects in the 52 (left) and H2 (right) series. 


antic data for the Swedish subjects (above) 


on the S series of contours (peak at 370 Hz). 
of question and statement 


Jotted the values of the terminal rise 


P 
Sure 1 presents the sem 


Tes; 
or |01998. Against the abscissa аге р alues ‹ rminal rise 
All in hertz of fundamental frequency (end point minus turning point): 


s Degati Ke nal 
вацу РРА inal fall, a positive value a terminal rise. 
Par: e value indicates a terminal taf, à t 
2 s oint f, (130 H 
for учев of the curves are peak f, (370-Н2) and turning p int f, ( S 


1, 175 Hz for 52, 220 Hz for 53). 
© effect of the terminal rise ОГ fal 
as expected: for all three series 


1 is immediately obvious and very 


е the higher the terminal rise, the 


Kerstin Hadding and Michael Studdert-Kennedy 353 


= 


` subjects were asked to ind 


E h t 
higher the percentage of question responses. Equally obvious wës: a 
of the fundamental frequency at the turning point. For Pt iatis 
parison we may consider the so-called points of subjective equa i om ps 
the indifference points at which subjects’ Tesponses cross over à abe 
dominantly statements to predominantly questions. For the Swedis mE 
jects we find the crossover in the 57 series at a final rise of 120 pe an 
52 series at a final rise of twelve hertz, and in the 53 series at аі па іа 
sixty-five hertz. Thus, the f, value of the turning point may quite ` ag D 
the effect of the terminal rise or fall. For example, a termina КЕЙ 
forty-five hertz is heard as а Statement 96 per cent of the time whe e 
turning point is at 175 Hz, but as a question 89 per cent of the time SC 
the turning point is at 220 Hz. Similar effects are present in the Ameri 
data. But the Americans display some preference for statements pe 
questions, As compared with the Swedes they require somewhat sma al 
terminal falls to be sure they hear statements, somewhat larger termin 
Tises to be sure they hear questions, E ain 
In the H series (peak f,: 310 Hz) the number of questions heard a | 
increases with the f, value at the turning point, although less marke 5 
Figure 2 presents the data Гог the Swedish subjects (above) and t п 
American subjects (below). Here the groups differ little in their questio 


: їп 
curves. But the Swedes display a preference for statements, particularly i 
the H2 and H3 series, 


. ‚ " i E 
turning point — is accompanied by an increase in the number of question 
heard. Figure 3 facilitates this со 


7 ish data 
mparison by displaying the Swedish is 4 
for the 52 and H3 series Оп the same axes, For example, a stimulus with 


а 5 he 
(ep » and induces a virtual reversal of t 
response distributions, 


Turning, finally, to the results of t 


| ich 
he psychophysical tests, in whic’ 
icate wheth 


{ЭТЕ ег the contours ended with а risin8 
or a falling pitch, we find 


354 Universality 


У А | 


US two-category semantic and psychophysical judgements 


96 responses in indicated class 


180 220 260 


-100 7—60 =20 0 20 60 100 140 


endpoint minus turning point in Hz 


Semantic А psychophysical H 
A question Arise С 
9 statement ofall 4 


Figure4 Percentage of statementand question responses (semantic: solid | 
Circle or triangle) and of rise and fall responses (psychophysical: empty | 
Circle or triangle) as a function of terminal rise (positive) or fall (negative) ү 
“hertz of f,. Data from US subjects on the H1 stimulus series. ? 


жее, E em 58 


marked, For example, Figure 5 displays the American psychophysical and 
Semantic data from the 53 series. Here, as is generally true, the psycho- 
rtain than the semantic — particularly У 


Physical judgements are more unce 
Ог the contours displaying termin: 


al falls. None the less, there is still | 
Temarkable agreement between the two se MU 


ts of curves. 


Discussion 

The results confirm what naturalistic observation and some previous ex- 

Petiments have already suggested: that listeners may make use of the entire 

Í contour in identifying questions and statements. Not only terminal rise 
у 


Kerstin Hadding and Michael Studdert-Kennedy 355 у 


US two-category semantic and psychophysical judgements 


9% responses in indicated class 
KI 
о 


o 
-10 =60 —20 0 26 60 100 140 180 220 260 


endpoint minus turning point in HZ 


tii 
Semantic Psychophysical 


A question Arise ' 


© statement 


circle or triangle) as a functi and fallresponses (psychophysical: empty 


hertz of f,. Data from US Шел cry tise (positive) or fall (negative) in 


x at cannot asi ; їп 
general, for а given f, at the other be easily described. But, 


i А two point: increase i ird 
point leads to an increase in then Dos, an increase in f ab the th 


356 Universality 


ЖО. E Su 
He Here арна, ) айа ве at 310 Hz and a turning point at 130 
ama S anie dui и g GE ire RR even when 
Statement e 2 p SCH SC RO PAS 
ЊЕ contour was 231. Since the turning point f, (130 Hz) was the 
іп the experiment, no final fall could occur with the HI contour, 

ү As was stated earlier, Swedish and American English are said to have 
Similar typical statement contours, but different typical question contours. 
As to questions, the data of the present experiment do not contradict this. — 
For, although both groups selected a typical Swedish question (53, 2 421) 
as their preferred question contour, the Americans did require a higher 
terminal rise to reach complete agreement on their question responses 
than the Swedes: lacking the. typical continuously rising question of 
Amer ican English (2 237), they gave more weight to the terminal glides than 
did the Swedes. However, they also gave more weight-to the terminal glides 
in the preferred statement series (НІ). This suggests that the two groups 
May differ in their preferred statement as well as in their preferred question 
Contours, 

Finally, the psychophysical data perh 
Process by which the f, values at peak an | 

Чепсе on listeners’ semantic judgements. These data show that listeners 
Vere unable to follow the terminal glide with anything like the precision 
St Tight have been predicted from simple pure tone pitch discrimination 
(Stevens and Davis, 1938): psychophysical judgements were influenced by 
Peak and turni НЕ poini f. very much as semantic judgements. In so far as 
Semantic and psychophysical judgements agree (as in Figure 4), it would 
Seem that listeners may have been using the perceived direction of the ter- 

inal glide rather than its physically measured direction to make their 

i k and turning point ^, 
f the terminal glide. On the other 
a display greater uncertainty than 
k and turning point f, values would 
ce on the semantic judgement, pre- 


Sum i ceived terminal glide.? 
ably j ; :nation with the per 
1 natio - 
У in some weighted combi oscar а bekam n. 
nts Bee jn Status Report on Speech Research, 
y con H 


aps throw some light on the 
d turning point exert their in- 


ее in so far as the psychophysical 
See; mantic (as in Figure 5), the DCH 
to exert an independent influen 


üboratories, New York, 190 


References { 
Oliy itch accent in English’, Word, vol. 14, 
bp, 102% D. L. (1958), “А theory of pi 

Borg RD arch devices based on a 


nh «speech тезе! 

“ашы ан (1950). "P Amera vol. 29, р. 777. 
Vocoder (abstract), J- Acoust. 

а Michael Studdert-Kennedy 357 


Kerstin Hadding ап 


TEATSE 


BRONSTEIN, A. J. (1960), The Pronunciation of American English: An 
Introduction to Phonetics, Appleton-Century-Crofts. А А и" 
Denes, P. (1959), “А preliminary investigation of certain aspects of intonation’, 
uage and Speech, vol. 2, pp. 106-22. ТЕЛО T» 
ek Р, апа EEN J. (1962), * Further studies in intonation a 
uage and Speech, vol. 5, p. 1-14. 
Босана (1939), КЫ ре *, J. Acoust. Soc. Amer., vol. 11, рр. 169-75. 
GARDING, E. (1960), ‘A study of the perception of some American English 
intonation contours', Paper read before 75th Meeting Mod. Lang. Ass. Amer., 
Philadelphia, 28 December. 
GARDING, E., and ABRAMSON, А, S. (1960), 
American English intonation contours’ 
Report no. 34, New York, June. 
HADDING-Kocu, К. (1961), Acoustico-Phonetic Studies in the Intonation of 
Southern Swedish, Gleerups, Lund. 
HERMANN, E, (1942), 


Probleme der Frage, Nachrichten von der Akademie der 
Wissenschaften in Góttingen, vol. 3-4, 


Pike, K. L. (1945), The Intonation of American English, University of Michigan 
Press. 


"A study of the perception of some 
» Haskins Labs, Quarterly Progress 


STEVENS, S. S., and Davis, Н. (1938), Hearing, its Psychology and Physiology, 
Wiley. 


ULDALL, E, T. (1960), * Attitudinal meanings conveyed by intonation contours’, 
Language and Speech, vol. 3, pp. 223-34, 


ULDALL, E. T: (1962), ‘Ambiguity: 


ү Question or statement? ог “Are you asking me 
or telling me?" Proc, [y Int. Со; 


ngr. Phon. Sciences, pp. 779-83, Mouton. 


358 Universality 


21 Marguerite Chapallaz 


Notes on the Intonation of Questions in Italian 


М, D 

үлене Chapallaz, *Notes on the intonation of questions in Italian’, from 

MS (лош of Daniel Jones, edited by David Abercrombie, D. B. Fry, P. A. D. 
сСагіћу, N. C. Scott and J. L. M. Trim. Longman, 1964, pp. 306-12. 


port interrogative sentences have the intonation of one or other of two 
Ке ic intonation patterns of Italian, viz. the falling and the falling-rising, 
erred to respectively as Basic Pattern 1 (BP 1) and Basic Pattern 2 (BP 2). 


Basic pattern 1 
SN similar to Tune 1 of English described by Armstrong and Ward 
О ЫЫ 4, 19-20), except that the last stressed syllable has only a very 
ofa all of pitch to a low level. If unstressed syllables, or a short group 
hy parenthetical nature follow, these syllables are on a low level pitch. 
M Та unstressed syllables form an ascending scale going up to the 
ressed syllable. 
eg 1 is the most usual pattern for X-questions, that is, questions 
ids mr with a specific interrogative word, in their simplest form; as, for 
ce in; 


— Е 94 D 
Come ha fatto? Quando ci 


p ЖА ТКН, ача РЕШ ae 


Re 
Chi dovrei annunziare? Quali lezioni avete avuto oggi? 


rivedremo? 


ie) ee 


"Dove vai? gli chiese il suo amico. 


ables form а descending scale, Within the last _ 


1 
Tune 1 is defined: “Тһе stressed syll 
15 to a low level’ (p. 4). 


Stres 
Ssed syllable the pitch of the voice fal 


Marguerite Chapallaz 359 


? 


Basic pattern 2 


i i and 
Similarly, Italian BP 2 is like English Tune 2 described by SCHON; E 
Ward.? It is the common pattern for Yes-No questions, t ha к; 
questions expecting (ће answer ‘Yes’ or ‘No’, as in the questions: 

De" 

ө E D AQ ° ° 
———a—— 29^. 
E permesso? Ti occorre niente? 


ERR 
Non ti andava il lavoro? 
There is, however, 
part of a BP 2 quest 
mate stress, I have 


Р 1 
» à great deal of variety in the treatment of the aN 
ion group especially when the final word has репи 
noted the following examples: 


1. The last stressed syllable may be on low level pitch with a rise of pitch 
in the following unstressed Syllable, as in: 


RN 
es mea XR У 
Io le ho dato delle illusioni? 


2. There may be a fall of pitch in the last Stressed syllable and a rise in the 
following unstressed syllable, as in: 


UTR EIL e 
LJ 


TNT WS] ZS ДА, J 
SE Ҷ 


L'avete trovato? Facciamo il bagno? 


3. The rise in pitch. may be spread Over t 
————— 


he two syllables, as in: 


A 
Ho indovinato? 


4. When it is the final syllable which is stressed, there may be a fall-rise 
within that Syllable, thus: i 


2. Defined: “The ош 


tline of [Tune 1] is follow, 
This is on a low Dote, апа 


ed until the last Stressed syllable . « * 
апу syllables that follow, rise from this point’ (p. 20). 


360 Universality 


5. А high pitch for the final syllable in the group creates the impression of 
heightened curiosity. Thus: 
= 
ч ° e oe 
ISP MENGE. RU I а Ws 
Та proviamo? Pensi di potei giocare? 
A parenthetical group following a BP 2 question, as for instance in 
Teported speech, often has a BP 1 intonation, but with a narrower pitch 
Tange than that of the main group. Ап example is: 


Sei contento? disse la mamma. 


Longer groups 

In the gradually descending scale of syllables in a long BP 1 or BP 2 
question, one, or sometimes more than one stressed syllable, is pro- 
nounced higher than the preceding syllable, the descent continuing after 
this raised syllable as before. There is thus a break in the gradual descent, 
the raised syllable forming a ‘peak’. (In the text of the examples below an 
ātrow shows the raised syllable.) ; 


(Cn MM EE 


А che disftanza è Londra? chiese, 
—X—— E Se 


È 


= Questo 10 sporftello per i telegrammi? 


То la devo intervistare? 


X-questions with basic pattern 2 
ere an answer is courteously requested rather | 
"Question is commonly spoken with ВР 2, as in these examples: 
c 


SOME EE 
Quando Potete farlo? Da che paese viene? 


than insisted upon, an 


v 


Marguerite Chapallaz 361 


Yes-no questions with basic pattern 1 


i i о est-ce 
Italian has no special grammatical written forms EM 
que or to the inversion of the subject in French, nor to t RK weg. 
finites? in question forms in English; so that a yes-no a i de 
spoken with BP 2 or it will have the intonation as well as the gr: 
form of a statement. 


The following examples with BP 2 are yes-no questions: 


= 


ө • =. TONN ° 
Non ё уего? То la tdevo intervistare? | 
With BP 1 the same examples are turned into statements, thus: 
d === eg 
| ée ER hee A 
Non è vero 


Io la tdevo intervistare, 


Under certain circumstances 


spoken with BP 1, This is whe. 
that th 


i а heard 
„ however, yes-no questions can be 


А Б ‘indicate 
n either the context is sufficient to indica 
© group is an interrogative one. 


» Or a short phrase ог ‘tag’ preceding 
wing the main group gives the clue, Examples: 


or follo 


A rivederci stasera, | ech? 


Alternative questions 


Basic Pattern 2 is general 


| for alternative Questions, as in the examples e 
low, but if these are Spoken in a more peremptory manner then BP 1 IP 
- be heard, 


3. The forms do, did, as in He went, Did ће go? 


' 
362 Universality 


Preferisce il dolce о la frutta? 


А common modification in BP 1 and BP 2 

Fond degree of liveliness and interest is added to a question if there is a 

oni pitch interval between the last stressed syllable and the syllable which 
edes it and which is at the same time higher than any other syllable in 


the group, Thus: 


© 
шиа aver, 
Che соз 57 ` Dove vai? 


L'ha portata tu? 


Emphatic intonation 


A К 
ке тош tlie modifications to BP 1 and BP 2 already mentioned in the 
ing notes and which add a certain degree of liveliness or interest, 
ble those used in English. 


Wa a 
eg giving extra emphasis to questions resem 
ustrate this I conclude with a few examples of: 
here the pitch range is widened and the stressed 


1. Е, Е 
phasis for intensity W 
dstress. (In the text the marks "show 


Sylla 5 
iles are pronounced with increase 
Sity stress and " contrast stress.) Thus: 


ће и d 
Cosa me ne im"porta 


Ee 


l + 
Cosa in'itendi "dire? 


D 
C 
era qualcuno? 


Marguerite Chapallaz 363 


2. Emphasis for contrast, where the pitch of the stressed syllable of the 
contrast word falls from a high to a low note as in: 


ЗЭ SR UM 


"Che "gridi? È la "donna che piange? 


Ма "Lei sarebbe il giornalista locale, vero? 
Reference 


ARMSTRONG, L, E., and Warp, I. C. (1950), A Handbook oh English иол 
Cambridge University Press, 


364 Universality 


Part Seven 
Perturbations 


die of intonation is complicated by the fact that many things 
mE when we speak cause changes in pitch that may be 
ra". or at least unrelated to intonation. For example, when we 
Ups and à particular intonation level are there nevertheless irregular 
In the fi downs associated with individual vowels and consonants? 
БЫН, Reading, Lehiste and Peterson demonstrate that such 
Vowel ations not only occur but are rather highly predictable. 
EA 5 differ in their tendencies to raise or lower pitch, and so do 
Жа ce A further question that always comes up when the 
the cy of Speech is compared with the melody of song is how exact 
idi p is. Two specific answers are given here on whether exact 
make § S are used, as in music, and, if not, whether all speakers at least 
Modine арз in pitch of approximately the same size when they / 
terms ce an intonation pattern. (The reader unaccustomed to phonetic 
may find it helpful to remember that ‘syllable nucleus’ is, in 


effc t 
“ct, the vowel, or diphthong, contained in а syllable.) 
f the perturbations noted 


а 
a D'Sonn) with intonation. Here int 20 
Ken чон with vowels. If intonation is à melodic line, how is 1 
A le to tell one contour from another — à question from à | 
ment, for example — when someone whispers? If there is no voice, 
LE fundamental pitch and no melody. Nevertheless listeners 
h, and apparently do 


са d 
E detect intonation contours in whispered speec 1 api 
У cues unconsciously supplied by the whisperer, which include 

ds. Meyer-Eppler's study deals 


Certa; 
ww distortions of the vowel soun pee аа 
erman, but the same phenomenon can ibe observed in English. 
16 sharpest clash of all occurs when the pitch of the voice is 
Wired to convey distinctive tone as well as intonation. Tone ay 
à Buages use contrasting levels ОГ movements of pitch as e s ~ 
the ture of syllables, just as they 050 consonant and Po pa s for 
Same purpose: the two syllables ba] and /та/ are different by 


T 


reason of their consonants; the two syllables /bá/ and /bà/ differ by 
reason of the pitch of the vowel, the first high and the second low — 
and this serves as well as anything else to distinguish one word from 
another. But when languages using pitch in this way also find it 
necessary to add another layer of pitch to show moods and 
attitudes (as most if not all do), certain adjustments are necessary. 
The best known of all tone languages is Chinese, because of its long 
cultural history. In her study of the Chengtu dialect, the third Reading 
in this section, Nien-Chuang T. Chang shows first the interaction 
among the tones themselves – the changes or ‘tone sandhi’ that occur 
when two or more are combined – and then the changes, now referred 
to as ‘perturbations’, that occur on just one syllable, the final one, 
to distinguish what is essentially a rising tune from a falling one. 
у Languages that distinguish words from one another by differences 
in tone are not all outside the Indo-European family that includes 
English. Some, notably the languages of Scandinavia, are part of that 
family and indeed closely related to English. It has been debated 
whether Norwegian, for example, should be called a ‘tone language’, 
defining the latter as a language in which each syllable is assigned 
a distinctive tone. With such a rigorous definition, the Scandinavian 
languages do not qualify. Nevertheless the ‘accents’ of Swedish and 

: Norwegian have àn inner distinction that is directly tied to pitch. The 
pemeiqucsiion therefore comes up again, as with Chinese: how are 
tone and intonation adjusted to each other? In the fourth Reading, 
Haugen and Joos describe the differences in pitch that set one accent 


off from the other in East Norwegian, and the différences of placement 
that affect the domains of tone and intonation. 


366 Perturbations 


Р 


22 lise Lehiste and Gordon E. Peterson 


Some Basic Considerations in the Analysis of Intonation 


Ilse Lehi 

pecie SCH E. Peterson, ‘Some basic considerations in the analysis 
ation', Journal of the Acoustical Society of Americ 1. 3: il 

cios 41325. 'a, vol. 33, no. 4, April 


Introduction 
Bow languages, the fundamental frequency of the voice has a distinctive 
erg In so-called ‘tone languages’, pitch level or movement may con- 
had E to lexical and morphological distinctions. In languages where pitch 
xS such function, levels or contours of pitch may in part determine the 
English", of the message in which the contour appears. Some analyses of 
us postulate suprasegmental morphemes, consisting of pitches and 
minal junctures, with differential meaning (Trager and Smith, 1957, 
E, 65-77). According to two widely accepted analyses, American 
EL. А has а system of four intonation Jevels (Pike, 1945; Wells, 1945). 
ES of these systems were formulated without the benefit of instrumental 
ces ie The present paper represents an attempt to analyse acoustically 
fact ntonation contour in American English, and to determine some of the 
Ors that influence the phonetic realization of the intonation contour. 


pun and method 
ue material analysed consists of two sets of utte 
BEE by one speaker, and a smaller set of 3 
tis ers (see Peterson and Lehiste, 1960, for further d 
His assumed that a reasonable correspondence between the two sets of 
indicates that the larger set may be considered representative, even 
E every item included in the larger set was not actually compared 
«^ data from several different speakers. The sets consist of the frame 
м the word ... again.” Primary stress and a change from highest to 
KS st intonation level occurred on the word that was in the commutation 
Sition. The speaker for the large corpus used 1263 CNC words* with 
€ frame; the five speakers of the control grouP all read an identical set of 
words consisting of an initial con- 
nuclei of American English, and a 
es and the syllable nuclei may 
ibed in more detail by Lehiste 


rances, a large set of 1263 
50 utterances by five 
etails on this corpus). 


Ilse Lehiste and Gordon Е. Peterson 367 


seventy words in the same frame. АП speakers used approximately the 
same stress and pitch pattern. 6 à 

Various acoustic analyses were made of the recorded data; the ege 
ments most relevant for the present paper were from four-inch wa ge 
band spectrograms. The fundamental frequency could be determined, TO! 
these spectrograms with an accuracy of approximately +1 Hz. 


Intrinsic fundamental frequency 


The average fundamental frequency that was associated with each stressed 
syllable nucleus was computed for both sets of data. The results appear in 
Table 1 and Figure 1. The fundamental frequency measures for the speaker 
of the CNC list are appreciably higher than those for the male speakers of 
the Peterson and Barney data. This is probably the result of the fact that 


Table 1 Average fundamental frequency 
associated with syllable nuclei 


Average for Peterson- 

SN five speakers СЕР Barney 
i 129 183 136 

1 130 173 135 

el 130 169 

E 127 166 130 

e 125 162 127 

ә 127 164 130 

а 120 163 124 

o 116 165 129 
SV? 170 

U 133 171 137 

u 134 182 141 
au 119 159 

ar 124 ` 160 

21 123 163 

E 130 170 133 


2. The fundamental frequency was derived by measuring the center frequency of 
selected higher harmonics on a four-inch narrow-band spectrogram; the measured 
frequency was divided by the order number of the respective harmonic to obtain t 
fundamental frequency. Usually, both the tenth and the twentieth harmonics Wo" 
measured, On these spectrograms 


harmonics are appreciably narro: 


i 


968 Perturbations 


© 
о 


fundamental frequency in Hz 
3 E 


160 


140 


130) 


a. 
ou ч u au а а $ 


Figure i tthe average 

1 The points connected by the solid lines represen 

undamental f ding to syllable nuclei, that 
requencies, arranged according yl 

pocurred atthe Ge оНће intonation contour in the 1963 CNC words иө 

P; the points connected by the dashed lines represent average vi 


1 
"От Peterson and Barney (1952) 


the Measurements presented in Figure 1 were taken at the e of the 
Intonation contour, and that the speaker employed a relatively wide range 
: A different intonation pattern was 
n the course of the Peterson- 
both Table 1 and Figure 1 that 
frequency is associated with 


Aen, in which /i/ and Jul are associa! 
am e i 
üssoc; пепга] voice frequencies, 
s /ә/ and fit occur approximate 


*. With the dipthongs, the peak of the | осе 
9n the first сд E fundamental frequency associated with it was 


Simila} to that occurring on /a/ and јој. This is not а new observation; 


Ilse Lehiste and Gordon E. Peterson 369 


reports have appeared previously in the literature describing similar find- 
ings (House and Fairbanks, 1953). In the present set of data, however, the 
intrinsic fundamental frequency is related to a specific intonation contour 
in spoken American English. If a system of several levels is postulated, it 
appears significant that the same level is habitually associated with a variety 
of fundamental frequency values, varying in a manner which is influenced 
by the phonetic quality of the vowel. In linguistic terms, the selection ofa 
particular pitch allophone is conditioned by the segmental quality of the 
syllable nucleus. 


‚ Initial and final consonants 


Each syllable nucleus thus appears to be associated with a specific average 
fundamental frequency. Within each set of utterances containing the same 
Stressed syllable nucleus, further regular variations were observed. The 
larger set of utterances contained several samples of the various vowel- 
consonant combinations, so that it was possible to study the influence of 
each initial and final consonant upon the fundamental fi requency associated 

_ with each syllable nucleus, The smaller set contained only from three to 
five occurrences of each of fifteen syllable nuclei, and therefore it was not 
possible to compare the results, although the same general effects were 
observed in this limited set, 


Table 2 and Figure 2 represent the influence of initial consonants upo? 
м 200 
= 


© 

о 

=> 
~ 


fundamental frequency їп 
= 
E 
S 


160 


150 


581 OLI -OLI SPI zer 


27 


Së z SPI OSI 991 OLI SLI А 
861 SOL 691 081 pLI LET ua 

691 6ST. 081 SOI OLI 861 ei 891 091 #07 ILI at 981 ^ 
ELI OT ELI 681 SOE vLI ILI $91 OLI 891 ZLI 781 781: 061 Ч 
861 OEI 69] et ZLI 091 $91 091 701 651 ISI 691 601 081 f 
SLI 681 661 881 811 991 ЄТ ELI ELI 601 581 681 LLI 2 
OLI OST 151 SSI ou SOT v9I O9T ZSI LSI SSI 691 191 Lt GLI I 
591 091 861 ISI 081 891 9$1 961 701 091 6ST 891 /91 681 1 

SLI OLI SPI S8I SOT SLI 891 891 ILI LOL ЕІ SLI 081 S8I D 
061 $91 09T e sol z 

OLI OLI nm LLI oi SSI SLI 89I ELI SLI 891 601 SLI LLI 98I s 
SOL Sst 051 961 951 OLI eet o 

081 091 Di 591 091 OLI LLI 781 9 
861 851 OST 871 851 bäi rei 691 eat A 
S91 91 SOL 091 a SOT ILI Zu OLI et ILI ELT OLL ISI 961 H 
691 pi 851 OEI ELI SLI 691 #91 gei IST ESI oi FOL рог ELT ч 
961 EOL oi Єт, £91 291 661 951 191 #91 861 891 et ш 
LOT TSI ZSI 091 091 OLI 651 091 991 961 €9I LOT ZLI OLI 8 
HI OLT Л 891 961 SLI 901 SLI SU EU TLT SLI SU SLI 761 х 
vor LSI 871 061 991 851 091 LST gei 861 991 LOT 081 р 
OBI OLT 691 SOF #61 OLI ELI ILI SLI 691 SLE OLI SLI 081 IGI з 
691 091 091 091 8/1 ELI Ol 191 #ОГ 651 ESI 691 191 691 8/1 а 
ELL SLI 991 891 861 8/1 181 SLI 991 691 OLI PLE 081 9LI CSI d 
302005002 

Suipooalq 

OLI 691 091 6ST 281 ILI 0/1 SOT oi POT ot 991 GOT ELT Є8Т ved обејолу 
© Ic ID np n п n9 c D e = з Dp I I HAHN 
JULS 


P10 1591 ƏY} JO juvuosuoo [иш ƏY} Aq poouongur SE “ZH ur 
*1103u02 попецојш ƏY} JO кәй әц уо Азџопбол 1®ўпәшЕрипу г гја8 1 


the fundamental frequency associated with syllable nuclei. The table 
presents the average fundamental frequency for all occurrences of the 
syllable nucleus preceded by each of the consonants listed.? Figure 2 
presents two curves, showing the fundamental frequency of two vowels /i/ 
and /z/ as a function of the initial consonant; these two vowels have the 
highest and the lowest intrinsic fundamental frequencies respectively in this 
set of data. The straight lines in Figure 2 represent the average funda- 
mental frequencies Гог /i/ and /æ/, computed for 105 occurrences of /i/ and 
131 occurrences for /z/. In general, higher fundamental frequencies occur 
after a voiceless consonant and considerably lower fundamental frequen- 
cies occur after а voiced consonant. This distinction is accompanied 
by a different distribution of the fundamental frequency movement ` 
over the test word: after a voiceless consonant, and particularly after а 
voiceless fricative, the highest peak occurs immediately after the consonant; 
whereas after a voiced consonant, especially a voiced resonant, the funda- 
mental frequency rises slowly, and the peak occurs approximately in the 
middle of the test word.* 

In these data, the final consonants have no such regular influence on the 
preceding syllable nuclei. Table 3 presents average values associated with 
cach final consonant and syllable nucleus; Figure 3 shows the same 
material graphically for the two vowels /i/ and /æ/. As in Figure 2, the 
straight lines represent the average fundamental frequencies for /i/ ап 
Je], computed for 105 occurrences of /i/ and 131 occurrences for 9]. 1 
appears that the distance of the points from the straight lines is relate 
to the number of occurrences of a particular syllable-nucleus, final-co™” 
sonant sequence. The greater the number of occurrences, the closer is th? 
value for a particular final consonant to the average walle and thus t e 
smaller the distance of the point from the line representing the averag? 
value. The position of points representing single occurrences of а PAP. 
Чеда Sequence may be influenced by the initial consonant, and таў fal 
ET within the limits of fluctuation possible for a given syllable 
EN SE seems probable that in English the voiceless-voiced contrast 0 

X. onsonant has no Significant influence on the fundamental 

quency appearing on a preceding syllable nucleus, The two instances 0 
E divergence from the average are the values associated with e 
Sequences /ig/ and /iz/. Both points represent only one word each, leag" 


3. The number of occurren i S 
ces from wh ол, 
may be found from the distribution ch puros e EAN p 


ES arts of initi. d sy’ 
nuclei in the CN C words. Lehiste ап EE 


^ d Peterson, (1959b) у 
4. This effect h: ibed i бар ЖЕ ste 
196). аѕ been described in more detail elsewhere (Peterson and гей! : 


. 372 Perturbations 


Ser 091 OLI OLI тот €LI ELI ELI OBI f 
_ 8i 191 SO SLI est CLE LOT SLI SLI ISI 2 
ИЛ SOT 001 861 rer OLI 691 OLI 691 591 591 191 691 OLI ISI 1 
9ST SOL TLE vLI 691 991 OLT ESI ЕД ~ H 

SLI 551 sor £ 

£LY Spl 861 LST 191 691 8:1 061 f 

091 091 OOF бот et ИЛ РӘТ 091 SSI EST 891 661 Fët 2 
TLI SLE 091 191 ot ʻOSI £91 £9L 951 POI 91 211 #81 5 
951 $81 OLT 891 SLI 781 9 

TLI OLI 781 591 ISI SST 091 LOT 091 ELI LET 9 
ил 9ST 081 691 OLI СӘТ Sat 891 81 rat А 
SLI OST 091 281 олт 591 £91 991 ST SLI OLI ISI J 
791 651 991 ил D 

691 OST 701 rt SLI 591 191 691 891 691 691 LOT 91 ISI ч 
591 6st ЕВЕ 191 TLI POT 091 LST 691 ILI eat ш 
OLI 651 LST 691 TOL 091 TLT 091 ELI ot 3 
“л 891 SLT ZLI ILI LOT 091 991 SOT єт LLI LLI FU J 
891 091 SOT 091 8/1 LOT 891 EST 091 SOT 691 LOT 891 ZLI OSI Р 
ил РӨТ LST 81 791 РОГ 891 091 РӘТ 691 oi 991 TLI 681 3 
591 591 881 091 SSE 191 LOT SOT OLI 991 а 
SLI vSI єзї 081 LLI LOT 9LI РОГ 691 ELE 6/1 IGI d 
302005002 
8шлмоцо 
OLT Е9Т 091 6ST ©81 ILI OLI SOT 691 POI 201 991 69T ELI Cat :yead обејолу 

* с mp np n а де © "8 е = з ә 1 1 гєпәәпи 
21901169 

P104 159] Əy} Jo JUBUOSUOD jeug og) Aq розиопри sp ‘ZH ur 

м “по 


7409 Don po oi зо Yad ay} Jo Азиопболу гејшошврип F AQEL 


È i 


е-—-:0 
180 


" fundamental frequency in Hz 


Pbtdkgmn TV e A s 2 


js l 


r 


| 
final consonants 


es for /i/ (dashed curve) and /ze/ (solid 
curve) for GEP as a function of the final consonant of the CNC sequence. 


The top straightline shows the average for all occurrences of /i/; the bottom 
straightline presents the average value of /ге/. 


Figure 3 The fundamental frequenci 


and liege, and it is likely that the init 
frequency on both words, 


Table 4 and Figure 4 present the combined data for GEP for all initial 


and final consonants and all syllable nuclei, The solid curve presents the 


average values for all syllable nuclei associated with an initial consonant 
and the dashed curve shows the 


ial Л/ has caused the low fundamental 


associated with the syllable nucleus, à? 


аге associated with a higher fundamen 
frequency than are voiced initial consonants, 


Test-word intonation contours 


The foregoing analysis has shown th 
nucleus is one of the factors that de 


374 Perturbations 


vi 
D 


Table 4 Influence of initial and final consonants on the fundamental 
voice frequency at the peak of the intonation contour in CN C words 


Average for all Average for all 

vowels after initial vowels before final 
Consonant consonant consonant 
р 175 174 
b 165 166 
t 176 168 
d 163 168 
k 176 170 
g 163 164 
m 162 168 s 
n 161 167 
D 165 t 
f 173 169 $ 
d 155 169 у 
9 173 170 ol 
б 161 171 
s 175 169 
z 169 171 
Џ 173 163 
5 165 i 
r 166 168 3 
1 164 169 d 
ё 177 174 
1 161 168 
ћ 174 
Y 167 | 
УЋ 174 3 
y 164 | 
Average — 169 169 | 
ee к-с „ье t 


ws fluctuations similar to those 


Nucleus. If the lower intonation level sho 
t be considered that the move- — 


9f the peak, the further implication should 1 1 
Tent from one intonation level to the next may involve a fixed ratio of 


frequencies, possibly corresponding to some musical interval. If this holds ` 
true, it is necessary to investigate whether different speakers use the same 
intervals when producing the same intonation contour. 

The SE Seier on the final part of the test word, as well as 
the fundamental frequencies associated with the precontour say the word 
and with the final word in the sentence, ‘again’, were measured for both 
Sets of data. Table 5 presents these data for СЕР, and Table 6 for the five ` 


Ilse Lehiste and Gordon E. Peterson 375 


N 
о 
о 


fundamental frequency in Hz 


HG DONDE ey oo ne ts et б т 


initial and final consonants 


Figure 4 Average peak values ofall fifteen syllable nuclei as functions of 
initial and final consonants. The averages were computed from the 1263 CNC 
words recorded by GEP. The solid curve represents the influence of the 
initial consbnant on the combined average of the syllable nuclei, the dashed 
curve the influence of the final consonant. The straightline represents the 
average for all syllable nuclei in the total set 


at occurred on the unstressed first 
5sed second syllable, and at the end 


y negligible fluctuations. This indicates 
are involved. As may be seen from 


Table 5 Average fundamental frequencies at specified points within 
the sentences uttered by GEP 


SN Number Fundamental frequency 
of Precontour Test word Епа of frame 
occurrences (‘word’) (‘again’) 
End Peak End Beg. Peak End 

1 105 129 183 794 115 132 87 
Ш 141 126 1733 98 114, 131 87 
$ 119 130 169 98 116 131 91 
E 94 129 166 95 115 132 88 
У 131 126 162 92 112 130 87 
ч 109 124 164 98 11 128 85 
5 75 127 163 93 113 128 88 
ES 79 125 165 92 11 130 84 
~ 93 126 170 93 13 130 84 
y 28 125 їп 90 109 127 81 
ЕЈ 74 127 182 94 13 133 87 
P 35 127 159 93 113 127 86 
s 93 125 10 91 iil 129 85 
a 16 128 163 93 11 19 84 
x 71 126 vm 94 103 131 85 
Average 127 169 94 113 130 86 
Total 1263 


VADO MN ee 


Major third and a pure fourth, whereas the speaker with the greatest voice 
inflection used a downward movement in frequency approximately 
Equivalent to a major seventh. It may perhaps be concluded that the actual 
interval range is irrelevant in this intonation contour. 

Table 7 also contains statistical information about the percentage. of 
musically ‘pure’ intervals used by the different speakers. The calculation 
I5 based on comparing the frequency ratios used by the different speakers 
With the ratios of successive harmonics of a complex tone. Several factors 
Make this part of the table tentative. The accuracy of measurement is 
арргохитајеју +1 Hz. This limitation of measurement accuracy affects 

€ ratio in a different manner, depending on the ranges in which the 
Measurements are taken. Little is known, however, about the fluctuation 


tudy, а musically ‘pure’ interval was defined as the differ- 

1180 боме ? uencies that can be expressed as à simple numerical ` 
i en two fi mental frequencies umeri 

Tatio: 2/1 for E 3/2 for a pure fifth, 4/3 for a pure fourth, 5/4 for a major third, 

6/5 fora minor third, Bis We considered an intonation pattern to represent а ‘pure’ 

E А S x { 

«terval when the ratio between the two frequency values did not differ from that of a 


Pure? interval by more than 1/100. 


5. For the purposes of this 5 


ilse Lehiste and Gordon E. Peterson 377 


Table 6 Average fundamental frequencies at specified points within | 
the sentences uttered by five speakers 


SN Number Fundamental frequency 
of Precontour Test word ne of. dec 
“word” again 
Ee op , Peak End Beg. Peak End 

i 25 102 129 90 92 100 77 
І 20 104 130 91 93 99 78 
е! 20 102 130 87 92 97 78 
Е 20 105 17 91 95 101 81 
æ 20 103 125 8 92 98 78 
ә 20 105 127 90 92 99 77 
а 25 104 120 89 90 100 80 
H 20 100 16 83 90 96 77 
oi 20 104 122 88 90 96 78 
U 15 101 133 90 96 97 78 
u 20 103 134 88 93 97 77 
ай 20 103 19 84 91 98 79 
ar 20 101 124 85 89 96 79 
= 5 103 123 88 89 96 77 
Sr 20 104 130 86 92 96 78 
Average 103 126 88 9 98 78 
Тога! 300 


a 


Table 7 Fundamental frequency ratios on the test words 
expressed as musical intervals 


= ДОО 


Average Percentage 
frequency Corresponding of ‘pure’ 
ratioon ` musical intervals 

Speaker fest word interval (3,4,5,6,8) 

Bi 136/83 1:64 т6-М6 25 

Br 126/99 1-27 МЗ-РА 27 

Ch м 120/82 1-46 р5.р5 22 

He 136/97 1:40 P4-p5 30 

Re 113/78 1:45 D5-P5 32 

СЕР (total set) 169/94 1:79 т-м7 14 

СЕР /1/ 183/94 1-95 M7-pg 14 

СЕР /z/ A 162/92 


1:76 т1-М1 4 


in fundamental frequency within which a listener may identify a pitch 
movement with a specific musical interval, particularly when this interval 
occurs in speech. It is at least a possibility that the fluctuation in funda- 

. mental frequency due to vowel quality has no corresponding effect on the 
perception of the pitch interval. 


Word and frame contours 


In the spectrographic analysis, 
frame, ‘again’, were always include 
Complete frame preceding the test 

‘Showed that the contour preceding t 


the test word and the final word of the i 
d. This often did not leave room for the 
word, but а selected set of analyses 
he frame was approximately level for 
the various informants. Thus, only the average fundamental frequency on 
the part of the precontour immediately preceding the test word (i.e. on | 
Word") appears in column 3 of Tables 5 and 6. Both perceptually and 
Physically (in terms of Hz) the precontour appears to form a middle — — 


intonation level compared to the highest and lowest levels that were M 
Observed on the test word. Since the segmental structure ofthepartofthe = — 
level occurred remained identical 


piterance on which this middle intonation 
9r all utterances, the data provide some information about the range of 
kop ations within one phonemic intonation level, unconditioned by dif- 
erences in phonetic quality. 


Table 8 shows that the syllable nucleus of the following test word has 


Table 8 Fundamental frequency ranges for ‘word’ in utterances 


Preceding syllable nuclei /i/ and /a/ 


funda rental Number of occurrences of 
Ze епсу ranges ‘word’ preceding SN | 
Word up: i 4 à 

106-110 Е 
uus 2 6 
126-120 17 12 Я 
o: das 33 20 4 
126-130 37 16 

121-135 71 

36-140 Ph 

141-145 Д 

С ЭЗ 


ental frequency used on the last word 
the number of instances in which the ! 
ithin a particular frequency range 3 


n t 
түзет] influence on the fundam 
пас Precontour. The table presents 
damental frequency оп *word' fell wi 


ilse Lehiste and Gordon E. Peterson 379 


preceding test words containing the syllable nuclei /i/ and EM bu 
quency ranges are approximately the same for the fundamental pa 
on ‘word’ preceding 105 occurrences of test words containing fil an a 
occurrences of test words containing /a/ as syllable nucleus. Since A 
intrinsic fundamental frequency on /i/ is appreciably higher than thet ga 
lal, the rise in frequency from the precontour to the peak of the intona! on 
contour is correspondingly different. The fundamental frequency ч 
words with /i/ rises approximately 55 Hz from the end of the аи G 
but only 36 Hz from the end of the precontour for /a/. Table 9 shows э. 
number of instances in which the rise in fundamental frequency from t 


Table 9 Ranges of the rise of fundamental frequency from the end of 
ty me precontour to the peak occurring on syllable nuclei /i/ and /a/ 


Difference between Number of occurrences 
precontour and peak of SN 

of SN in Hz а 
CT 
11-15 


2 
16-20 1 
21-25 8 
26-30 SE AE 
31-35 2 20 
36–40 8 16 
41-45 OK 0 
46-50 20 5 
51-55 20 1 
56-60 177 1 
61-65 18 
66-70 3 
71-75 3 
76-80 1 


end of the precontour to the peak of the test word fell within a particular 


range. Since the values on the precontour remained relatively constant, H 
greater rise was associated with /i/ than lal. Ў 


The actual value reached by the syllabi 
initial consonant, as has been shown, С 
ranges of fundamental frequency for the 


for example, the fundamental frequency of a vowel with high intrinsic 
fundamental frequency occurring in a word beginning with a consona? 
that has a lowering influence may overlap that of a vowel with a 10 

intrinsic fundamental frequency preceded by a consonant that has а raisi”? 


е nucleus depends partly on P f 
onsiderable overlap between t А 
different vowels may be ехресіёС? 


380 Perturbations 


influence. Table 10 
+ resent i 
Overlap between thi inox s both the ranges for /i/ and /a/ and the area of 


H 


"Tabl 
at е 10 Fundamental frequency ranges for test words 
ining syllable nuclei /i/ and /a/ 


Fu 

RE en "al Number of occurrences 
p псу ranges of SN 

136-140 

141-145 2 

146-150 5 

151-155 TG 

156-160 Ger 

161-165 19 
ОЧЫ: 

з 10 7 

(EMI 20 5 

81-185 

186-199 (ponit 

191-195 24 

196-200 6 

201-205 4 \ 
206-210 У 

211-215 i 

ES vu СЫ E 


occurring on a syllable nucleus 
e repetitions of the same word. 
which the fi undamental fre- 


su. 
ay кон, the fundamental frequency 
able 1 over a certain range in successiv 

shows the percentage of instances in 
e fundamental frequencies 


Tabl 
е 11 Percentages of instances in which th 
fell within various ranges 


а55ос, 
c lated with the 1263 utterances of ‘word? 


Fy 
паат 
ental frequency Percentage of 


anges j, 
10 Sin Hz occurrences 
5-11 

TE 2:6 
136-120 42 
157125 21:2 
126-130 251 
EE 30-5 

36-1 40 9:2 
141-145 66 


Ilse Lehiste and Gordon E. Peterson 381 ` 


| 


quency on the 1263 occurrences of ‘word’ at ће end of the precontour fell 
within a specified frequency range. Approximately 75 per cent of all 
occurrences were between 116 and 130 Hz, but the total range was from 
105 to 145 Hz, 

The contour applied to the last word in the frame is, in a sense, a smaller- 
scale repetition of the sequence of three levels that appeared on ‘Say the 
word . . .' Here, too, we found three levels; from the point of view of the 
item ‘again’ alone, these might be described as a sequence of middle, high 
and low intonation levels. However, the actual values that appeared as а 
manifestation of these three levels differed considerably from those appear- 
ing on the first part of the contour. For ДЕР the high level on ‘again’ 
was consistently slightly higher than the middle level of the first part, but 
was very considerably lower than the high of the first part of the contour 
(Say the word . . ."). For the five speakers of the smaller set, the high of 
the sequence of intonation levels on ‘again’ was lower than the middle 
of the first part of the contour. The low of ‘again’ was noticeably lower 
than the /ow that occurred on the test word. The drop from high to low in 
the contour on ‘again’ was always smaller than the comparable drop ОП 
the test word; expressed in musical intervals, the average drop was approxi- 
mately equal to a pure fifth (v. a major seventh) for СЕР and equal to 4 
major third (v. a diminished fifth) for the five speakers, The physical data 
do not suggest any immediate technique for identifying the levels as they 
appeared on the word ‘again’ with any of the levels that occurred ОП 
the first part of the contour preceding the word ‘again’. , 

We considered the hypothesis that the pitch peak on the word ‘again 
might be conditioned by the presence of secondary stress at the beginning 
of the second syllable, and that the intonation pattern on ‘again’ might 
thus involve only a sequence of middle intonation level followed by 10" 
intonation level. In the case of GEP this appears plausible, as the ауега®® 
value of the fundamental frequency of the peak that appeared on ‘again’ 
was slightly higher than that occurring on the precontour. In the case 9 
the five Speakers, however, this hypothesis appears untenable. The pe? 
on ‘again’ is lower than the level used on the precontour; there were 
actually a considerable number of instances where the fundamental La 
doo the word ‘again’ was consistently falling, so that the frequen 
Eerst e eege 
RS F € 6, it may be seen that the average differ 

ееп the values that were measured on the unstressed and 5176550 
syllables may differ by as little as 1 Hz (when ‘again’ followed words wit 
the syllable nucleus /u/), with a maximum average difference of 10 
CN Oe Sé difference between, the averages ОП D 

stressed syllables in ‘again’ is approximately 6 Hz, wie 


382 Perturbations 


\ 


кү pedi with the differences that occurred on different repetitions 

а ае word in the precontour, where the differences between the 

Re, dece to a maximum of 5 Hz. In all instances, however, sub- 

| СА ^ med made it possible to identify the stress on the second 

Mie again’. It appears from the analysis of this part of the intona- 

| B d our that differences in stress are not necessarily represented by 
itioned differences in the phonetic realization of intonation levels. 


Conclusion 


ДЕ investigation reported in this pape 
E МИН in the instrumental analysis О 
d intonation level may havea wide га 
Ы со that influence the selection of a particular pitch allophone 
Bund ro described. It appears that the phonetic quality of the syllabic 
tion Ze? ап influence on the fundamental frequency at which the intona- 
Vowel s el 15 produced. Further, the initial consonant in à consonant- 
M E may influence the fundamental frequency appearing on the 
Shey ollowing the consonant. The variations in fundamental frequency, 
еа that may occur when the same intonation level is repeatedly 
With Ze оп the same word, may be greater than the variations associated 
When anges in segmental quality; the differences can only be established 

asufficient number of utterances are compared. The influence of stress 


Upo 
a the manifestation of a particular intonation level needs to be explored 
€ fully; the data reported here suggest that, at least in some instances, 
ressed syllable than on à 


o 

Gees frequency may occur оп à stress t 

ments ing unstressed syllable. The problem of relating contourlike move- 

than de musical intervals seems to be less relevant for а study of English 

inton, ог a study of tone languages; it appears from our data that the 

mu nation contours of American English are not based on recurring 
Sical intervals, Most of the data Р ate the realization of 


a si resented illustr 
Mee intonation level occurring under sentence-maximum stress; the 
ion of contrastive intonation le 


egr vels and their relation to contrastive 
` Bre 4 М 
es ОҒ stress remains to be const 


into, dered. The instrumental analysis of 
Nation emerges as a problem of great с 


r indicates a number of problems 
f intonation. A linguistically sig- 
nge of phonetic manifestations. 


complexity. 


ferences 
of consonant environment 


G. (1953), ‘The influence 
J. Acoust. Soc. Amer., \ 


Ous 

E 

» А. S., and FAIRBANKS, 1 : 
teristics ОЁ vowels’, 


Upo, 
Yol "Ans secondary acoustical charac 
ашы? DP. 105-13. К. 
Spe ТЕ, L., and PETERSON, О. Е. (1959а), *Linguistic considera! 
Ley Sch intelligibility’, J. Acoust. Soc. Amer., vol. 31, PP: 280-86. 
ДЕ I., and Peterson, G. Е. (1 Laboratory Report No. 3, 
Iversity of Michigan. d 


tions їп the study of 


Ilse Lehiste and Gordon Е. Peterson 383 


KA “vk А 


the vowels’, J. coust. Soc. Amer., КО, 24, Do 175-84. 

and LEHISTE, Г (1960), ‘Duration of syllable nuclei in English’, 

. Soc. Amer., vol. 32, pp. 693-703. 

РЕКЕ, К. L. (1945), The Intonation of American English, University of Michigan 
reese 4) F 

H E G. L., and Ѕмітн, Н. L., Jr (1957), An Outline of English Structure, 

| Studies й in Linguistics: Occasional papers no. 3, American Council of Learned 

. Societi es, Washington D.C. 

(1945), ‘The pitch phonemes of English’, Language, vol. 21, pp. 21-39. . 


23 Werner Meyer-Eppler 


ан 4 
ealization of Prosodic Features in Whispered Speech 


‘Realization of prosodic featu i i 
р! res in whispered speech’, 


Werner Meyer-Eppler, 
vol. 29, по. 1, 1957, pp. 104-6. 


Journ 
nal of the Acoustical Society of America, 


Author's summary 

Experi Ah e 

пав utilizing а visible-speech analyser showed that changes of 

shifts ut normal (voiced) speech are replaced in whispered speech by 
of some formant regions accompanied by added noise between 


the higher formants. 


Introduction 

Iti 

Nu pons fact that people can be understood without any difficulty 

Strange a SEA instead of speaking normally. This fact is not very 

ЕР e formant frequencies of the vowels and the envelopes and 

ation ci ng fricative and plosive sounds are considered to be the infor- 

full infor ТУШЕ elements of speech. It must be doubtful, however, whether 

Chinese шшс, сап be carried by whispered speech in tonal languages like 

tiate the г many West-African languages where pitch is used to differen- 
meaning of various lexical items consisting of otherwise identical 


а of phons. 

A RIA Panconcelli-Calzia (1955) and Gi 

Sé of whispering in tone languages. 
that it was difficult for Chinese-born subject 


Pered Chi s VEM 
Chinese, Giet, who had lived in China for many years as a missionary, 
is as effective a means of verbal com- 


St 
E whispering in Chinese 1 é у 
(1956) SCH as normally spoken speech. According to Giet's arguments 
Shit a must exist some substitute for the missing pitch quality in 
Se? speech within the acoustical range. As Giet has already pointed 
REN is no need to use tone languages for investigating substitutes in 
EXE re speech. Similar results would be achieved by using any language 
ЫК, e belongs not to the phonemic but to the prosodic level, 
скр erm refers to features belonging to a sentence as a whole that are 
essed by pitch and stress patterns. Intonation e.g. may differentiate 


twi Е 
ееп a question and a statement. 


et (1950) have dealt with the 
Whereas Panconcelli-Calzia 
s to understand whis- 


Werner Meyer-Eppler 385 


1.-21 


té 


| Vowels ‘sung’ without voice 


Some orienting investigations were undertaken with German vowels 
‘sung’ without voice. It is not difficult to produce the same whispered 
vowel on different pitch levels within a range of about a musical fifth 
(i.e. a frequency ratio of 2:3). Obviously this can only be done by changing 
- the spectral structure of the vowels within the limits of recognizability. 
The subjects were asked to ‘sing’ the first five tones of a diatonic scale 
(е.в.: c, d, e, f and g) maintaining the quality of a given vowel as well as 
possible. The sounds were recorded on magnetic tape and analysed by 
. means of a visible-speech analyser (Sona-Graph). The spectrograms of а 
= test series using the German vowels [a] (as in Tal), [е] (as in See), [i] (as in 
viel), [o] (as in Sohn), and [u] (as in Schuh) are shown in Figure 1. Whereas 
K In the case of [а], [e], [i] and [о] the position of the first two formants 
.. remains unchanged, the third formant of [a] is shifted from its position near 
H 25 kc to about 3 kc if higher pitch is intended; a similar shift is found at à 
Gel weak fifth formant near 5 kc. In the case of [u] the main formant itself i5 
. Taised from 600 Hz to 700 Hz. This can be seen more easily in Figure 2 


о 


frequency їп ke 


(a) 


. Figure 1 German vowels, 


whispered in a diatonic scale 


f 386 Perturbations 


LY Ney E 


"-——À 


where an enlarged frequency scale together with better spectral resolution 
is used. The spectrogram of Figure 2 was achieved by playing back the 
magnetic tape upon which the vowels had been recorded at a higher than 
normal speed. The higher formants of [u], however, remain unaffected 
(Figure 1). Since in the case of [e], [i] and [o] no very clear shift of formant 
Positions can be observed, the apparent change of pitch must be caused by 
Other spectral properties. Pike already had supposed that differences in 
intensity might serve as substitute for pitch (Pike, 1948, p. 34), and his 
| expectation is confirmed by the spectrograms of Figure 1. Raising the 
| ‘pitch? of [e], [i] and [0] means increasing their intensity, thus filling the 
| Варѕ in the higher spectral regions with noisy components and eventually 
| broadening the formants above 2 kc to a less-sharply profiled, fricative- 
like spectrum. The same happens with [a] and [u] in addition to the shift 


frequency їп ke 


frequency in kc 


Werner Meyer-Eppler 387 


. Figure2 Three vowels of Figure 1 with enlarged frequency scale and improved 
spectral resolution n 


_ of their formants. Observation of the ‘singing’ subjects reveals their larynx 


to be raised at the ‘higher’ vowels, indicating a narrowing of the glottal 
fissure. 


Analysis of Spoken sentences 


It might seem that singing without voice is a rather unnatural process, and 
that results obtained in this case need not necessarily be applicable to 
Spoken words or sentences, Visible-speech diagrams of whispered words, 


frequency Tn ко 
B 
{с=з 


-gu :t ; buit 
Figure3 Examples of differentintonation of the word 'gut" 


frequency in КО 


—zaen 


Figure4 The whispered words‘... sein!’ and‘... sein?" 


however, show that the same effects as with sung vowels occur. Since [u] 


isan exceptionally good vowel for investigating the influence of intonation, 
Sentences like ‘Das ist aber nicht gut? and ‘Ist das etwa nicht gut? pee 
Whispered by different speakers and analysed. Figure 3 gives an GEN x: 
the word ‘gut’ spoken with the level tone (апа with rising tone 0). a 2 
latter case the shift of the formant of [u] towards the likewise raised 10r: 


Tant region of [t] is very impressive. 


3 s 
£ JU 
> 5 
5 
Бе 
= 
і 
4 
2 
D 
о 


Figures The words *ja?' and Ја!" whispered by two male subjects 


Werner Meyer-Eppler 389 


CR 


frequency in ke 


‘ja 
1; Figure 5 The words ‘ja?’ and ‘ja!’ whispered by two male subjects 


from 
А new phenomenon occurs in the words [-zaen] and ['zaen], De D 
. Sentences like ` Das sollst du sein P and * Wer soll das sein ? which E iba 
. inFigure4. The interrogative intonation causes a new formant to orig 


i n 
at 2 kc belonging to the considerably reinforced [n], whereas the diphthong 
[ае] shows no clear differences. 


- Figure 5 was chosen to give an im 


pression of the reliability of our con- 
- clusions concernin 


А de 
g the shift of formant positions. The same pair SE R 
having different intonations (7а? and ‘ja!’), as spoken by two male 


Y е 
= jects, shows, despite the unequal length of the individual vowels, the sam! 
- type of evolution of the third formant, 


Summary я 
Spectrographic analysis of wi 


exist two substitutes for pitch movements which in voiced speech are SC 
i _ to indicate different prosodic features, The whispered vowels [е], [i] and [0 
у 
и 
1 


e 
hispered vowels and words shows that ther 


3 d E r- 
substitute spectral noise for pitch, whereas [a] and [u] possess some fo 
.. mants whose position changes with the intended ‘pitch’, 


__ References 


_ GIET, Е. (1950), Zur Tonität 


nordchinesischer Mundarten, Verlag der 
Missiondruckerei St Gabri 


el, Vienna. 2-81. 
GIET, Е. (1956), ‘Kann mann ineiner Tronsprache flüstern ?, Lingua, vol. 5, pp.37 hen 
21 PANCONCELLI-CALz1A, С. (1955), ‘Das Flüstern in seiner physio-pathologis¢ 

А Bedeutung". Lingua, vol. 4, pp. 369-78. 

` 


„ K. L. (1948), Толе Languages, University of Michigan Press. 


to TD 


DA D 
24 Nien-Chuang T. Chang 


Tones and Intonation in the Chengtu Dialect 
(Szechuan, China)! 


Ein Chuang T. Chang, ‘Tones and intonation in the Chengtu dialect (Szechuan. 
ina)’, Phonetica, vol. 2, nos. 1/2, 1958, pp. 59-84. ` 


Notes on the transcriptions 

He t, k, ts, tf = aspirated 

|, d, g, dz, 45 = unaspirated p, t, k, ts and tf respectively 
» 5. tf, 45 before i and у = prepalatal 

П before i = pn 

T = of when syllabic; when it is after 

friction 

= Lin ai and ei, otherwise = i 

= er in ei 

4 = £ in ien and yen, otherwise = € 
= а, in au, ai and when before n 

а when before or when final 


‘ 
J,3, tf, dg, itisa fricative with strong 


Џ 


9 = o, when before 9 
= 9 when final 

З = U in au and eu, otherwise = u 
= ә- in ou 


= A when before п, otherwise = 9 
e extent with the tones; opener 


The quality of vowels varies to som 
urth tones. 


` Varieties are generally used with the second, third and fo 


Introduction 


к Chinese language there are 
to arate set of tones. In order to та! 

Nation I chose to work on the Chengtu d 
mary of a Ph.D. diss 
dialects (Szechuan, 
to the University of 


many dialects. Each dialect has its 
ke a careful study of tones and in- 
ialect of Szechuan, this being the 


ertation entitled ‘A descriptive 
China) and the intonation of 
Edinburgh. A large number of 
kymograms and graphs plotted from their tracings 
has been shortened. This study I took up at the 
Head ofthe Department of Phonetics, University 
rch I received invaluable advice from him and 
atefully acknowledge. 1 wish also to thank 


m uthor’s note: This is the sum. 
Certai Of the tones in the Chengtu 
GINE types of sentences’ presented 
ауе PS as well as the sonograms, 
Sug een omitted, and the discussion 
gestion of Mr David Abercrombie, 

те anbureh, and throughout my reseal 
lizabeth Uldall, which I here EF 


Nien-Chuang T. Chang 391 


2 


dialect I was brought up on. It is also the mother tongue of the informant, 
my father, who was born and brought up in Chengtu. He speaks no other 
dialects. E 

In this study I am trying to find out (1) whether intonation exists in the 
Chengtu dialect; (2) if it does exist what then becomes of the individual 
tone, which is one of the basic elements in the word; (3) whether the in- 
dividual tone always remains exactly the same no matter if it is spoken in 
isolation or in succession, i.e. whether the tone changes if it follows ОГ 18 
followed by the same or another tone; (4) if it does change when spoken in 
succession, if it no longer retains the value which it has when pronounced 
by itself, then what the change is like. t 

In Part I of this study I shall deal with tones, in answer to questions (3) 
and (4). In Part II I shall study their relationship to intonation, trying (0 


. answer questions (1) and (2). In studying intonation I first recorded eight 


_ hours of conversation with my father. From the recordings I picked ony 
sentences whose intonation could be grouped under various emotional 
states or attitudes. With the help of a swanee whistle and a tape-repeater 
I noted down the intonation. Finally I checked the results on the spectro- 
graph. In observing the tones and their changes, I first wrote down words 
of one syllable, and then phrases containing two or three syllables in all 
the possible combinations of tones. They were read aloud and the tones 


noted down. The results were then checked on the kymograph and the 
spectrograph, 


Part 1 Tones and their changes 


When a Chinese character is read aloud the sound produced consists of not 


only the consonants and the vowels but also а tone. This tone, which is use 


in reading aloud a character in isolation, may be called the Naming Ton? 
(МТ), since it is the tone by which that character is known. It is used whe? 
the character is uttered by itsel 


f, not in conjunction with other characters: 
But for a character occurring in a phrase or a sentence the naming tone I$ 
often replaced by another tone. The naming tone and those which take 15 
place are allotones of one toneme. This replacement of one tone by anothe?: 
i.e. the interchange of allotones, is called perturbation or tone-sandhi i? 
this study. 
Each tone has its Shape or Feature, 
and course. By ‘pitch’ 


*course' I mean whet 


This consists of two elements, pit?” 
I mean whether the tone is high or low or mid. BY 
her it rises or falls or remains level. The pitch dis 


Professor Y. R. Chao of the University of California, who gave me many important 
Suggestions through correspondence and pi 


ћ а- 
S rivate conversations. Without the coop®" 
tion of my father this study would not have been Possible.) 


392 Perturbations 


Р 
cussed here is relative and not absolute. It is relative in the sense that every 
individual has his or her range of voice. 


Monosyllables 

a we divide the pitch of an individual’s voice-range into (1) high, (2) mid- 

cn. (3) mid, (4) mid-low and (5) low, the four naming tones in the 
hengtu dialect may be described as follows: 


1. Tone I, high-rising —— - it starts between mid-high and mid and rises to 


high, e.g, [tjin] 4 (clear). 


2. Tone II, low-falling — — — it starts somewhere lower than mid and ends 
when referring to weather). 


between mid-low and low, e.g. [їп] 4 (fine, 

3. Tone III, high-falling —— — it starts about mid-high and falls to some- 
Where a little higher than low, ¢.8- [tjin] (to invite). 

4. Tone ТУ, low-falling-rising —— — it starts about mid-low and falls to 
lowand then rises ending at about mid or higher, e.g. [tjin] A (to celebrate). 
Often the fall reaches so low a point that the voice is almost creaky. 


Using Professor Y. R. Chao's method of showing Mandarin tones (1948), 
We may represent the four naming tones of the Chengtu dialect as follows: 


Two-syllabled group > 


In the two-syllabled group the tones 


1. Toneme I becomes а mid-level tone when i 
when it follows 


тА But it remains high-rising 
Tecedes another toneme. 


andhi is as follows: 


t follows Tonemes I or П or 
Toneme IV or when it 


Nien-Chuang T. Chang 393 


Si 
4 
^ 
dé 
LN 


eg. T.I- TI:  Tl-14 [оп ful time 
P T ah T.I 41-44 [gue dzial пайоп 
'T. OIL + T.I \1-7 [tsau gu] mushroom 
T.IV+T.I 41-41 [mien bau] bread 
т.ї+Т.П 11-1] Ши relatives 
T.I-- T.I Мел [fian gan] Hong Kong 
T.I+T.IV 14271] [9500 dziau] religion 


1 ws 
2. Toneme II remains low-falling no matter whether it precedes or Toho + 
another toneme. But when it is reduplicated as a form of address or 
baby talk then the second syllable becomes a mid-level tone. 


eg T.Y--T.II Mall [Гаја knowledge 
TAT. Wil [айза] ^ hairclip 
T.HI--T.X la [ieman] savage 
T.IV - T.H. All 4 [di tfiou] the globe 
ЛИФТА 1-44 [liba] fence 
Lt Меј шр racket 
ТИТ үз JN [tan go] sweets 
T.I-- T.IV Nu-4.] D deal ` soap 


T. П (same Ыз] [baba] father 
Syllable re- 
duplicated) 
3. Toneme Ш remains high-falling whenit follows another toneme. The fall, 
` however, often reaches only to mid-low instead of low. When it precedes 
another t 


‘oneme then it becomes a high-level tone, When it is reduplicated 
then the second syllable becomes a low-falling tone, 


e.g. T. 1+ T. III "Nah [рәп bon] the origin 


Грот ivu] friends 
T. II +T. N Ме [fiau tfou] clown 
T. IV + T. mt viN?4N [fumu] parents 
T. HI 4- T.I N17 ай pyjamas 
ПУШ Е T.H Wd "ld Шаш Оо small flags 
T.M+T.IV Nat) [fiau Qi] stingy 

T. II VW [ьш bau] baby 
(reduplicated) 4 


4. Toneme IV becomes а very low-falling to! 
when it follows another toneme, H 


. 894 Perturbations 


е 


eg T.I+T.IV 44714 о 4500] weight 
T.H-4- T.IV JA-J. iguan] habit 
T.NI--T.IV А4574 [kon ра] perhaps 
ТУ + T. IV Ad A. [yin tfi] luck 
T.IV4-T.Y &4Ad-441 Idi fan] place 
T.IV-- T.H d-s) Lian pil rubber 
Т.у + Т.П АМ>ИМ Шао хай Shanghai 
тлу 
(reduplicated) 44-4] Idi di] younger brother 


5. When a syllable is not stressed as often happens with the particles, it is 
Pronounced so short that we cannot distinguish whether it is going up or 
down. In such cases it is called a neutral tone, represented by a dot. The 
Pitch level of the neutral tone is decided by the toneme preceding it. It is 
high when preceded by Toneme I and Toneme III but is mid when preceded 


by Toneme II or Toneme IV. 


eg. T. Y 4'| [tadi] his 
T.I 4-41 [bedi] white 
T. Ir N 12511 [guei di] ghost's 
T.IV A Геза! di] ugly 


The following chart shows the combinations of the two-syllabled group 


and their tone-sandhi. 


1 11 Ш D 


Three-syllabled group » 1 

In the three-syllabled group the tone-sandhi is i Ke , 

1. Toneme I remains high-rising when it isin the initial position. It e 

mid-level when final except in the combinations П + ТУ + Тап + 
++ Lin which cases it remains high-rising. 


Nien-Chuang T. eue 395 


ә 
"А 


When it is in the middle position, then if the first syllable is T. Tor he 
it becomes a mid-level tone; but if the first syllable is T. II or T. IV, t 
remains high-rising. 


i i in 
2. Toneme II has no change whatever. It remains a low-falling tone 
whichever position it occurs. 


igh-level 
3. Toneme III remains high-falling when final and becomes high-level 
when initial. у "M 
When it is in the middle, then if the first syllable is hi or T: 2 cal 
mains high-falling, though ending at about mid-high; but if the firs 
lable is T. II or T. IV, then it becomes high-level. 


4. Toneme IV remains low-falling-rising when initial. id 
It becomes low-low-falling and is checked by a glottal stop when final. 
When it is in the middle position then it becomes low-level. 


5. The following positional variants occur: 
Initial: Т.І remains high-rising, 

T. II remains low-falling. 

T. III becomes high-level. 

T. IV remains low-falling-rising, 


Medial: T. I, when the first syllable is T. I or T. III, becomes a mid-level 
tone. Otherwise it remains high-rising. 
T. II remains low-falling. 


T. III, when the first syllable is T. II or T. IV, becomes high-level. 
Otherwise it becomes half-high-falling, 
T. IV becomes low-level, 


Final: Т. I becomes mid-level except in the combinations II + IV + 1 


and HI + IV +] in which cases it remains high-rising. 
T. II remains low-falling. ; 


T. III remains high-falling. 
Т. ТУ becomes а low-low-fall 


ing tone and is arrested by a glottal 
stop. 


The following figure is a ch 


art showing the positional variants of the 
three-syllabled group. 


396 Perturbations 


k 


initial ` medial final 


Hr 


dax] 


naming 
toneme — tone 


Examples of three-syllabled group 


T nde 


ADU Yo | 


IIHI — 4442444 [sanJyentan] 8500р made of three 
ingredients 
ї1+ї+ї aed [fu dauan tai] dressing-table 
IHIH меч [kua son mi] peanuts 
I-cIi-ciV мени [tuani dzin] а long mirror 
IHI I 441-144 [dziau ma dail peppered chicken 
IHME дет 00 li nien] Junar year 
LL mt мем Гап meu Азад] chief of staff 
IMEI 414211 [dzi du dziau] Christianity 
III Aacht) бәп ӧзӱ] сарай, 
ї+ш+и (W244 [dzin r xuan] golden earrings 
Хепи 4NNOTNN Шш gu foul bandman 
I-cgn-piv qed [Шеп li dzin] binoculars 
TRIVAT AN [endzdgnl | бен Clos 
Zi 5 TERT A bu dal the municipal council 
I+IV амо [ог lu Гей eau-de-Cologne 
T+Iv-+iy 444-14] [4580 dai xuei] reception party 
Zitt S Шәп fon 420 gramophone 
Пети E A [y gan iou] cod-liver-oil 
Dim [Ned Гаа ла Tei, 
UII Mali De dgmbip] mental disease 


Nien-Chuang Т. Chang 397 


~ 


Fr 1111—14 
HAEE Woy 
MEHI AN AN 
Ш+П+1У Wow 


D+M+I УМ) 
UHE Wy 
I+ M Im МУ 
N+ M + IV МИ 


HIVI 141—111 
M+IV+H 444-414 
I+ IV + M JANAN 
Ш IV E IV Au 


Ш+1+1 


[dzyo ta tfe] 

[tie so tfiau] 

[xan i li] 

[be Je d3uan] 


[fu li dzin] 

[in хо tfon] 
[хап fu biau] 
[pin go fu] 
[зод Den fan] 
[fan bu tfuan] 
[хоар dou fon] 
[dza xo dien] 


М1Ї1#Л44 [fuei fien xua] 


MEIHE үт [fou fon tjin] 
TL +1+ Hr МУ 14 son dzin li], 


Ш+І+ЈУ ҸМ») 


Rum 
T+M+ Wow 
ME I+ TE WN 


MAEI WWW 


MMI yy 

I+ III + Ir NS 
TII -+ HII + TIT NST 
I + W + 1V МИ 


W+IV+I N1121 

TII + IV + IT Wea 
Ii + IV + HII МАМУ уҹ 
III 4- IV -- IV NA 


IV-FI-d-I 


IV--I-C-I AW 


[Juei ien dai] 


“1744 Газие! fuo gau] 


[fuei lon tou] 
[li tie guai] 


[fien uei dzin] 


[bau fien fian] 
[pau ma tfan] 
[bau [ou dan] 
[uci go dien] 


[da dzz di] 
[iau tsai tfan] 
[li bai п] 


A114 [gan mien dzan] 


4112414 [dien don 
pa 
2AN [азаи xua 2 


IV+I+M ANAN [uan fon т] 


IV+I+IV AM 
I+II Adj 


[uai dziau bu] 
[ian pi dzin] 


IV--ILEIE AM AM [iania tfuan] 


W+0+00 AN 


МЧ [822 ion dan] 


DN ZIL D UW [ti tou. daiay] 


398 Perturbations 


bicycle 

iron-chained bridge 

the name of a lane 

the Story of the White 
Snake, name of a play 

the fox spirit 

firefly 

thermometer 

apple tree 


knitwear 

canvas bed 

yellow bean powder 
a general store 


narcissus 
accordion 
general manager 
waterpipe 


lipstick 

water tap 

Cripple Lee, a legendary 
character 

microscope 


safe-box 

race course 

the Conservative Party 
fruit shop 


typewriter 
vegetable market 
Friday 

rolling pin 


bulb for lamp 

beggar woman 

an ancient kind ОЁ 
megaphone 

Foreign Ministry 


Tubber band 
ivory bed 

the Free Party 
barber 


| 


4 
| 


WMI ANATH [dai 
a ba] trumpet 
ag HIHI AWA [dien xo luf аш fire 
D + III + IIT ANN [dzau dgiou ап] brewery 
+I IV АМА АЛА [dzau dzu dgiau] Bishop Chao 


У y IV--I 44124 [kuai dzi ssl accountant 

ту 4. IVIL Мија Шәп dan dsie] Christmas 

тү IV + МАМА [da Jan xai] Greater Shanghai 
-IV IV AAAA [da dziau foul professor 


чу "4 SX 
he results of this investigation may now be summarized: 


1. There are ten principal allotones for the four tonemes. 
They are: Toneme I f (1) high-rising 
(2) mid-level 
Топете Ш _ (3)low-falling 
(4) high-falling 
Toneme III | (5) high-level 
(6) half-high-falling 
(7) low-falling-rising 
Toneme IV | (8) low-low-falling 
(9) low-level 
(10) neutral tone 


2. 
3 Toneme II always remains low-falling. 
ie oneme I and Toneme IV remain unchanged wh 
lal position. 
4. Я 
а Toneme I goes through perturbation the naming 
ced by a mid-level tone. 


5, 
Toneme Ш remains unchanged when it is in t 


me III is replaced by а hi 
laced by а half-high-falling 


en they are in the 


tone is always 


6. T y 
itis 58 naming tone of Tone 
SE in a three-syllabled group- jt is rep 

К when it is the middle syllable. 

is ane Naming tone of Toneme IV is replaced bya low-level tone when it 
ўй the middle of a three-syllabled group. It is replaced by а low-low- 
ing tone checked by a glottal stop when it is the final syllable.* 

first syllable has the strongest stress, the last syllable 


ТЕ 
е oe three-syllabled group the 
ndary, and the middle syllable bas the least Stress. 


Nien-Chuang T. Chang 399 


1 
` E 


gh-level tone when | 


d 1р of the 
Ialso worked on the four-syllabled group though without the help 


inati ied and 
Spectrograph. A total of 256 four-syllable combinations were studie: 
the results are as follows: 


it is in the initial position, When it 
1. Toneme I remains high-rising when it is in the initial Yeh 
is the second or third or fourth syllable then it becomes mi 


AP E P € 
2. Toneme II remains low-falling in whatever position it occ 


igh-Jevel in 
3. Toneme III remains high-falling when final. It becomes high-] 
any other position. 


DUE initi ition, 
4. Toneme IV remains low-falling-rising when it is in the и e 
but becomes low-level] when it is the second or third syllable, an 
low-low-falling when it is in the final position. 
5. The following positional variants occur: 
Initial: T. I remains high-rising. 
T. II remains low-falling. 
T. III becomes high-level. 
T. IV remains low-falling-rising, 
becomes mid-level, 
П remains low-falling. 
Ш becomes high-level, 
IV becomes low-level, 
becomes mid-level, 
II remains low-falling, 
III remains high-falling, op: 
Т. IV becomes low-low-falling and is arrested by a glottal stoP 


e itional variants of the 
The following figure is а chart showing the positional variants 0 
four-syllabled group, 


Medial; 


Final: 


Т. 
415 
ip. 
Т. 
T. 
15 
As 


naming 


toneme tone initial medial {пш 


400 Perturbationg 


` Part 2 Intonation and its relationship {о tones. 


| 


Intonation is the fluctuation of the voice pitch as applied to the whole 


Sentence, It is the sentence melody and is superimposed on the sentence as 


a whole. Tones apply to individual syllables whereas intonation covers the 
whole sentence. Unlike tones, furthermore, a change of intonation does 
Not affect the lexical value of words. It only adds shades of meaning to the 
Sentence spoken and brings out the attitude of the speaker and the emo- 


tional state he is in. 


Every community has its own inton i 
changing the voice pitch when uttering the sentence. The fluctuation of the 


Voice pitch of the individual follows, consciously as well as unconsciously, 
these patterns. Those whose intonation does not coincide with these , 
Patterns are considered foreign speakers. (‘Foreign’ in the broad sense, . 
meaning ‘strange’ or ‘peculiar’ ог ‘alien’.) Those who are not familiar 
With these patterns naturally miss the subtle ‘overtones’ of the sentence 
Spoken, 
One would imagine the pitch of each syllable in a tonal language to be 
fixed beforehand, and therefore that it would be difficult for a tonal 
language to have intonation. But on closer examination we find pitch 
Phenomena which we can only regard as ‘intonation’ superimposed upon 
the tonal system. Apart from the change due to tonal environment as 
shown above, there remain characteristics and modulations of the voice 
Pitch which bring out different shades of meaning. The fact is that the 
Sentence may be spoken in different ‘keys’ when representing different 
attitudes, and that the syllables go through perturbation (see under рег- 
turbation? below), thus giving the whole sentence a rising or falling tune. 
I shall now try to describe the intonation of some types of sentences ү 
0 Chengtu dialect, the circumstances under which they are used а the 
Shades of meaning they convey. According to the data which I have 
assembled, intonation in the Chengtu dialect may be regarded as consisting 


9f three factors: 


ation pattern, i.e. its own rules of 


ence is spoken - - - This may be 


Jow and low. 
. The range may be divided 


Zo The pitch level оп which the sent 
divided into high, mid-high, mid, mid- 


2. The range of pitch the sentence covers - - =: 


Into wide, medium and narrow. 

3. Perturbation of the final syllable ---- In connected speech, syllables 
Often form groups of two, three ОГ four and their perturbation follows the 
Patterns discussed in Part I. It is the final syllable alone, however, which 
d ther the sentence is а question or a state- 


gives the clue to the listener whether у 
ment, whether it has a rising or falling tune. I must here explain that this 


Nien-Chuang T. Chang 401 


` 


rising or falling has no reference to the pitch of the preceding syllables, 
but only to the pitch of the final syllable itself. Thus whether I regard a 
sentence as having a rising or a falling tune depends on whether, after 
undergoing perturbation, its final syllable is a rising or falling tone. In the 
case of a rising naming tone of the final syllable being replaced by its level 
allotone, I classify the sentence as having a ‘falling’ tune; and in the case 
of a falling naming tone being replaced by a level allotone I classify the 
sentence as having a ‘rising’ tune. 


— 


The examples are put on music manuscript paper. The four spaces and | 
the blank above the top line of each staff represent the pitch levels. (High, 
mid-high, mid, mid-low, and low.) The intonation of the sentence 5 | 
marked above the phonetic transcription. The mark [ represents rising, 
D represents falling and [-] represents level. The difference in length of 
the marks represents roughly the relative time taken over the syllable 
uttered. In rapid conversation many words are unstressed and become | 
neutral tones. These are marked with dots. The Arabic numerals under | 
each syllable represent the toneme to which it belongs. | 

The examples given are all picked out from the eight hours’ conversa- 
tion I recorded, Unfortunately there is scarcely one single sentence among 
themrthat is spoken With two different types of intonation. But against this 
disadvantage may be set the fact that all the examples are from real life 
Situations; none of them have been spoken with ‘simulated emotions’ Of 
read aloud, or made up for the purpose of illustrating intonation. 

Rhythm, stress, tempo and voice quality also help to indicate the mood 
or the emotional state of the speaker. Where these elements seem significant, 
I have also touched on them in a very general way. 


A 
Ordinary statements 


high 


пау —————— 


mid-high 
mid-low 


8 S den Че (Не) went at five o'clock 


"Чай, in c D 
dzin tien tfi bau It's dull today 
1 1 4 23 


fien ^ Hupeh i 
2 2 3 х lupeh is the earliest. 


402 Perturbations 


А E 


Е 
ki 1 have not read it yet 


lou an 
E А 2 3 4 


Ordinary questions 


Or —Е——— 
S = 
sag Ја dau li What reason (shall I) say? 
3 


2 Did he eat agaln when ho camo back? 
2 1 4 2 


EE EE 
EE 
= лом 


diau Who broke it? 


ра Bud sop lan How didit get loose? 
4 1 S 


Emphatic sentences 
EVE BUT, ECCO 
ee 
es 


miu I dien tso Nota b/t wrong 
4 


ELE 


SE 


EE 
Of course It is Shantung dialect 


=n 


san дод xua 
4 


pojia | digi 
332483 4 


EE 
EE 


ou uan ti tju lai 
2 2 


EE 
OX ei P 
Szechuan started much earlier 


ss tfuan deal SC 
kd 1 4 3 


Nien-Chuang Т. Chang 403 


I think there must be something wrong 


Sentences expressing emphatic approval 


ee —M 
a EE 
EE 


Ee 3 ыл 
D duet duei duei Yes, that is right, quite right! 
4 4 4 
Le M 
> = = 


= n 
la dan 3an mai de da) san mai de Of course they're selling а lot 
LETT 254 , 


ni I tsan зәп You are cruel! 
3 


әш уеп gai Тап Only he has the authority to change it 
at 2 2 3 x 


Sentences expressing annoyance or vexation 


5 2 fon ^ san How can one eat this 


iio dayen хай It's not well folded | 


You just 
à 5 just reason it out! 


2 


re n Si There aren't any vegetables! 


404 Perturbations 


Sentences expressing awe 


It's really like a miracle T 


—————— аса 
тыш d 
=e 
A xau de хоп It's really good! 
3 ° 3 


They are all huge! 


d 
3e | Чзуе da й tso и This is а great mistake! 


2 uA 
Кү 
entences expressing contempt 


E = _____-=-- 
< . 
а ey finish censoring 1? 


[ - 
а ii xus tja de uan How can th 


i 
ай пр јо de dguan san You'll have to predict correctly 


D 3 ° 


соо ==» 
9 xuei de How Is this possible, Indeed? 


HB Dau 


L-22 Nien-Chuang T. Chang 405 


` Sentences containing a protest 


О =- ~ — 


r x he lay xa lə ір uei dza Yes, yes, but who's afraid? It's because ««o 
q ° e 3 4 e | 2 2 


= 
ko i fan ge dag d3iay 4520 Surely you can translate it, yes, yes, yes 
О ee S 3 3 
Y i cestors 
la li ni la li If you don't even worship your an 
TERM т T Басти 3 3 8 3 3 thenhowcan you... 


Кыыс == ==————— 
LEE 
- la а daa mo fiau d ta How did he know him? 
de 5. S TOM 1 


Sentences expressing surprise 
Г 5 lo' Hasn't he received It yet? 


f 
Т a Д Ја an Where did he put it? 


ta зч хо xuei kan d 


le азе gia fon 
122 А 4 2 


How could ће see all this? 1 


o dza 7 Juan tsue ` Why wasn't mine pure? f 
3 i 


406 Perturbations , 


Sentences implying а dismissal of the topic 


man xuu mə (It's) pretty good 


i dain pau dau jo (Не) already escaped 
Y ® Ud 


dzi bu dau (1) can't remember 


Чг Та tfe 5 Walk straight down 
2 4: ге 


Unfinished sentences 


EE 
SE 
mi bu dau =“ Not long afterwards «оо 


em Г 1a das зәп di lien па Therefore, people's faces -eo 
И РИКА Јо 
КУР оу BEE 
= 
с=с == >= 
Gre ы qu бз зп bn DI 
г Тап "jou dze 02 ep Ji so long as the man Is there D 
ата етан 


Nien-Chuang T. Chang 407 


Ordinary sentences 


By ordinary sentences I mean statements and questions used m 
polite conversation. The speaker is good-humoured andina = ere 
mood. He is emotionally placid and calm, and is изо шш = 
says. He is merely stating a fact, not giving it particular emp . 


Statements 


i i range 
- The pitch level of this type of sentence is between mid апа low. The 


is medium. | | —. 
If the statement consists of several high tones, i.e. high-rising 


7 i the 
и Thigh-falling, then each one of them starts on a lower pitch than 


e 
preceding one. If there are several breath groups in one sentence, then th 
first breath group is higher in pitch than the following ones. ina high 
this type of sentence has a falling tune. If the sentence ends in turally 
tone while the rest of the sentence are low tones the high tone "E 
remains higher than the low ones, but even the high tone has Ce à 
ation to fall. The perturbation of the final syllable is as follows: 


Toneme I (naming tone: high-rising) becomes mid-level. 
Toneme II (naming tone: low-falling) remains low-falling. 
Toneme III (naming tone: high-falling) remains high-falling. falling 
Toneme IV (naming tone: low-falling-rising) becomes low-low- 
| checked by а glottal stop. 

As will be seen later 
of the final syllable, 
Sentences with a risin 


ation 
› this is one of the two patterns for the ES. 
and is shared by all sentences with a falling 
£ tune follow the other pattern. 
Questions 


te- 
The general pitch level of the questions is the same as that of the st 
ments, namely, between mid and low. The range is medium. final 
This type of sentence has a rising tune. The perturbation of the 
syllable is as follows: 
Toneme I (naming tone: high. 
higher than usual. 
Toneme П (naming tone: low-falling) becomes low-level. 
Toneme III (naming tone: high-falling) becomes high-level. 
Toneme IV (naming tone: low-falling-rising) becomes low-rising- 


е5 
2 tenti 
Perturbation of the final syllable in sen 


p. ends 
rising) remains high-rising and often 


This is the pattern for the 
with a rising tune, bock 
In spoken Chinese, Sentences often 


end with particles like [a], [san], 
Пе], [lo]. These partici 


es аге meaningless by themselves, but they РЈУ 


H ` 408 Perturbations 


Za Ze “ 


important part in bringing out the intonation of the sentence and th 
denote whether the sentence is a question or a statement. If the particl А 
pronounced on а high pitch level or with a rising tone, then the Ur = 
is a question. If on the other hand the particle is pronounced with a falli x 
tone, then the sentence is a statement. It may be asked whether it is SC 
particles that fix the intonation of the sentence or whether they M 
bring out the intonation more clearly to the listener by indicating Se 
the sentence has a rising or a falling tune. The latter explanation seems a 
more plausible one since the same particle can be used in different types of 
Sentences and it is then pronounced with different tones. 

These particles are often also used with unfinished sentences, in which 
case they are pronounced with a rising tone, and give a sense of suspense 


to the listener. 
On the other hand, questions cont 
[od ) or Adjective-/o-A djective (e.g. 200) 
ave a falling tune like that of the ordinary statement. 


aining Verb-no-Verb (e.g. *go or not 
d or not good?) constructions 


Emphatic sentences 


xi emphatic sentences I mean state 
mphasis or prominence to some speci 


S into contrast with other points ог 
motionally he is not agitated. In ordinary speech, Chinese syllables are 


more or less evenly stressed. But in this type of sentence there is often one 
tes pulag word or syllable which receives an extra stress, the word being 
í € point emphasized. This stress on the part of the speaker seems to 
mply: * This is what I mean.’ 

The pitch level is between mid-high and low. 

The range is wide. 

The perturbation of the syllable receivi 


TonemeI (naming tone: high-rising) remain 

т higher than its normal pitch in an ordinary state 

TR aa II (naming tone: low-falling) falls yet lower. 

SE re III (naming tone: high-falling) becomes high-level. 
oneme IV (naming tone: low-falling-rising) remains low-falling-rising 

but ends in a higher pitch than usual. 
This type of sentence has à falling tune. The perturbati 
Syllable is the same as that in the ordinary statement. 


ments in which the speaker gives 
fic point. He is concerned to bring 
to intensify its significance. But 


ing extra stress is as follows: 


e high-rising and ends yet 
ment, 


on of the final 


tudes or emotional states 
fact or giving special emphasis 
ant to do more; we want also 


Sentences expressing certain atti 


When we speak, we may merely be stating а. 
© certain points. But sometimes we шау wi 


Nien-Chuang T. Chang 409 


to convey our personal reactions or attitudes to our listener or to express 

_ our feelings as well. Under these circumstances our emotion is a pre- 

2 dominant element; therefore the intonation we use is different from that 
A we use when speaking under unemotional circumstances. 

-  Tnthis section I shall describe the intonation of several types of sentences 

_ which express different attitudes or emotions. The seven types of sentences 

that I choose are: 


— 1. Sentences expressing emphatic approval. 
. 2. Sentences expressing vexation. 
3. Sentences expressing awe. 
.. 4. Sentences expressing contempt. 
_ 5. Sentences containing a protest. 
b 6. Sentences expressing surprise. 
7. Sentences implying dismissal of the topic. 


1. Sentences expressing emphatic approval: 


| _ By these I mean statements in which the speaker is very sure of himself E 
.. atthesame time is in perfect accord with the last speaker. There is a sort о 
= finality in his Sentence, It implies ‘that’s that’ or ‘I know it is 50 mi 
` Showing approval the sense involved is ‘Quite right!’ or ‘That’s just it. 
The pitch level of this type of sentence is between mid-high and low: 
The range is wide, The tune used is a falling one. The perturbation We 
` ` final syllable is the same as that of the ordinary statement. 


2. Sentences expressing vexation or annoyance: 


" This type of sentence is used when the speaker is in a bad mood. He 7 
tryingtostartan argument. What is implied seems (о be ‘Now I ask уоп... 
or ‘It’s all your own fault, so...” or ‘How can you ask such а stupi 
question ?* 
The pitch level of this type of sentence is between high and mid. Th? 
ange is medium. It has a rising tune. The perturbation of the final syllable 
is the same as that in the ordinary question. i 


3. Sentences expressing awe: 


_ This type of sentence is used 


ће 
эч у when the speaker wants to show that what ^' 
.. is talking about is so 


З mething of great importance. He wants to impress n 
р listener and at the same time to dud m idea that he himself is impres 
` By what he is trying to tell. Не aims to create awe among his 1900 
What is implied is "This is something wonderful!” or ‘That is terrific w 
This type of sentence is spokenon a low pitch, varying between mid-lo' 
` and Tow. The range is narrow and all the tones seem to be compress? 


410 Perturbations 


together; therefore there is a tendency for all the tones to become level. 
Both the rising and the falling are very slight. The sentence has a falling 
tune. The perturbation of the final syllable is the same as that of the 


Ordinary statement. 


The voice quality in this type of sentence is often ‘breathy’ or ‘husky’. 


4. Sentences expressing contempt: 

This type of sentence is used when the speaker is in a contemptuous frame 
of mind. He is ready to snap at the person spoken to and close the con- 
Versation as soon as possible. The sentence implies *This is impossible', or 
"What nonsense you are talking’ or "Letz proceed no тоге.” 

The pitch level is between mid-high and low. The range is wide. The 
Sentence has a falling tune and the perturbation of the final syllable is the 
Same as that of the ordinary statement. 

The characteristic feature of this type of sentence is that one syllable in 
the sentence is always lengthened. The syllables coming before or after the 
lengthened one are usually huddled together and spoken quickly; thus they 


Often become neutral tones. 


5. Sentences containing a protest: | 
This type of sentence is used when the speaker is greatly agitated or 
excited. It is often used in argument when the speaker hopes to shout his 
Opponent down. Unlike sentences expressing vexation, the speaker is not 
deliberately starting an argument. On the contrary, he is the victim; he is 
being provoked. He is anxious (0 make himself understood. Under these 
Circumstances, the listener is often also trying to talk at the same time; the 


~ 
Tesult therefore is that this type of sentence is usually spoken по 
On a high pitch level, between high and mid-high. Sometimes bc 
may start on a high pitch level and then fall to low, but pb E 
high-pitched part of the sentence that contains the protest, : u ш. 
lime the voice pitch falls to low, thespeaker's emotional state has ге 


IO ished. 
to norm iti infrequent that the sentence 15 left unfinis 
eue S row. It has à rising tune and the 


The range of this type of sentence is паг 1 y 
Perturbation of the final syllable is thesameas that of the ordinary question. 
6. Sentences expressing surprise 
This type of sentence is used when 
Tee It implies incredulity as well. It mean: 

rue?’ / 

The pitch level is between high and mid-low. The xp ipea starts 
оп a high pitch and gradually falls. The rane " wide. * as ч 5 ra tune, 
The perturbation of the final syllable is the same as that of the ordinary 
Statement. 


the speaker is taken by surprise or is 
s ‘Really?’ or ‘Сап this be 


Nien-Chuang T. Chang 411 


7. Sentences implying a dismissal of the topic: 


i i ith some- 
i ker is preoccupied wi 
i tence is used when the spea! Wier 
2 Ze Sg Dees not mean that the speaker wants to Se ы e 3 s 
} Minos nor is this type of sentence as нча о ween 
Ze апа vexation, In this case, the ES = ун не 
i i ismiss the subject m 
ther topic. It is used to dismis bosu п М 

cin to. tt implies ‘Never mind this, it’s not inris ENET o 
Es pitch level is between mid-low and low. The se ados 
` fore the rising and falling of the tones аге very slight. degt. 

i has a falling tune and the perturbation of the final sylla 

‘that of the ordinary statement. 


ing conclusions: 
From the results given above, we may draw the following c 


f 
; the type 0 
1, There is a definite relationship between pitch a i on a high 
= Sentence. For instance, sentences containing a Ze К topic аге spoken 
- pitch level whereas sentences implying dismissal o 

| 


п 
1 statements О! 
on а low pitch level. But it is difficult to make any general 

this relationship, 


s con- 
2. (a) The Tange of the pitch varies with the type of sentence. пл ~ сот- | 
.  tàining a protest and sentences implying dismissal of the OE the other 
| pletely different pitch levels, yet both have a narrow ШЫ for example, | 
hand, emphatic sentences and sentences expressing cont ind ave a medium 
both havea wide range, Ordinary statements and questions f the speaker. 
/ range. Thus, the Tangeis at least a clue to the emotional pe o for all the 
(b) When the range of a sentence is narrow, there is a ten vec alight 
tones to become level, i.e, the rise and fall of the tones beet? GE 
б за) The Perturbation of the tones of the final syllable in the gen ze 
BI follows two distinct patterns, In Sentences with a ‘rising’ tune, 
___ turbation of the fina] syllable is as follows: 
` Tonemel (МТ, high-rising) remains high-rising. 
Toneme II (N.T. low-falling) becomes low-level. 
Toneme III (N.T. high-falling) becomes high-level. у 
Toneme ТУ (N.T. low-falling-rising) becomes low-rising. V. 
Ша! 
` Tn sentences with a ‘falling? tune, the perturbation of the final sy 
` is as follows; 
. Топетеї becomes mid-level, 
Топете II remains low-falling, 
Toneme Ш remains high-falling, 
Toneme IV bec 


omes low-low-falling, 


412 Perturbations 


G 


A 


(b) The two tunes are used for different types of sentences. The rising 


tune is used for 

1. Questions other than those containing Verb-no-Verb and Adjective-no— 
Adjective constructions. 

2. Sentences expressing vexation. 

3. Sentences containing a protest. 

4. Unfinished sentences. 


The falling tune is used for 
1, Ordinary and emphatic statements and questions containing the Verb- 
no-Verb and Adjective-no-Adjective constructions. 
2. Sentences expressing emphatic approval. 
3. Sentences expressing awe. 
4. Sentences expressing contempt. 
5. Sentences expressing surprise. 
6. Sentences implying dismissal of the topic. 
Tone-sandhi has already been studied in many Chi | 
desirable for similar work to be done on intonation. A very interesting 
Question is whether in other dialects intonation is also indicated by the 
Perturbation of one particular syllable, which in the case of the Chengtu 
dialect is the final syllable. It would also be interesting to know if the re- 
sulting tunes could be divided into two or more patterns. If a number of 
Other dialects could be studied along similar lines to the present inquiry, 
We could perhaps come to a general explanation of tonal behaviour and 


Intonation in the Chinese dialects. 


Summary 


From th 

е above we conclude that | 4 
1. To ini ion behave differently from those pronounce 
Н nes pronounced in isolation be itg SE 


In connected speech. І nected speech they 
; peech. In соппе у 
This is usually governed by the position they occupy in the Gong s by 
me tonal environment. It may also be governed by grammati structure, 
though this d f resent inquiry. ` 
oes not form part of my P' u 
- Besides the four naming tones in the Chengtu dialect, the author found 


Six other tones which, together with these naming tones, could be grouped 


Into four tonemes d 
3. Intonation does exist in the Chengtu dialect. It is superimposed on the 
Sentence as a whole. And it is this superimposed intonation that modifies 
the individual tones and not the tones themselves that decide the intonation 


9f the sentence. 


Reference ` 
Ciao, Y. R. (1948), Mandarin Primer, Harvard University Press. 


Nien-Chuang T. Chang 413 


nese dialects. It would be ` 


_25 Einar Haugen and Martin Joos 


Tone and Intonation in East Norwegian 


Einar Haugen and Martin Joos, ‘Tone and intonation in East Norwegian’, 
Acta Philologica Scandinavica, vol. 22, 1952, pp. 41-64. 


4 


Authors? note 


The purpose of this study was to provide (for the first time) spectrographic 
evidence on the tonal patterns of a Scandinavian dialect, in order to illus- 
trate m у contention that the so-called ‘word tones’ could only be ud 
Stood in relation to the linguistically significant stressed syllables. As I hà 

written in 1949, *the difference between two significantly contrastive tones 
may consist of nothing more than a different timing of the tonal curve in 
relation to the syllabic Stress’. In the absence of any satisfying definition © 

Stress, Т took it to be located in those syllables which in Norwegian hav? . 
either a long vowel or a short vowel followed by a long consonant: VO): 
УСКО). I called this minimally necessary portion of the stressed syllable; 
dc or УС, its "core" (р, 430) and showed that the two tones contraste! 

in the timing of their highs and lows in relation to this core. Much more ® 
Dow known аБош the nature of the tones in various Scandinavian dialects, 
and questions have been raised about the very existence of what I calle 

Stress" (e.g, by Fintoft, 197 , esp. p.37). Regardless of what the acoust! 

correlates of stress will turn Out to be, it cannot be questioned that tli? 
(usually) initial Syllable of a Germanic language like Norwegian is Jin- 


guistically primary. Tone 1 sin 
GEN t of stress. 
there are two different to, ardly be the sole determinan ] 


SOT nes i А i rceive 
being i dentically Suid in syllables which native speakers ре 


_ Our article was also те 
е with Gentence) intonation, Every word in the language has a? 
gi tone (which is Don-contrastive in monosyllables). In native Wor 

one 15 generally 

See my article 1967), A 
identical with prim 
marked tone, peculiar to 5с 


414 Perturbations 


EE 
a 
j A few small corrections have been made in the text of the present article, 
ee the addition of some more recent bibliographical references 
atellite has replaced ‘contour’ where this is contrasted with ot, 
> 


and the measures have been marked in the tr ipti 
8 anscription, after bei - 
defined as juncturally bounded. Жо 
E.H., 1971. 


tudy is to analyse the function of pitch in a 
has a particular interest for general 
lled ‘word tones’, which have been 
Indian and Oriental lan- 
s names, but we shall here 


zu purpose of the present S 
ii orwegian utterance. This topic 
inguistic theory because of the so-cal 
Compared with the tones of various African, 
piu In the literature the tones bear variou 
the them ‘accents’ in order not to prejudge their nature. The simpler of 
two, which is closest to the pitch patterns of other Germanic lan- 
een will be called accent 1; the more complex, which is peculiarly 
candinavian, will be called accent 2. As early as 1860 the Norwegian 
Phonetician Johan Storm succeeded in identifying and describing the 
musical difference between such otherwise identical words as bønder 
d armers’ (accent 1)and bønner“ beans * (accent 2). Heelaborated his descrip- 
lonsin later publications, but never went beyondan auditory determination 
expressed in musical notes. Instrumental evidence was brought to bear in 
the 19205 in a series of studies by Ernst W. Selmer, who used a kymo- 
wj to analyse the word tones of the dialects spoken in Oslo, Bergen, 
i ауапрег, Sunnmore and the Faroe Islands. Selmer was not greatly 
nterested in the relation between these tones and the intonation of the 
Whole utterance, but this was the chief topic of Ivar Alnæs, whose book 
Norsk Setningsmelodi of 1916 is still the only one devoted to the study of 
Norwegian sentence intonation.’ Important additions to our knowledge 
Concerning the function of the tones have been made in a series of articles 
by Olaf Broch (e.g. 1935, 1937, 1939, 1944), in which the relation between 
tonal and rhythmic patterns is clarified. We will not here be concern 
With the large body of literature that discusses the historical origin of the 
heir distribution in the present-day уоса- 
out that parallel phenomena are to be 
М ber of important E 
ave made their а rance in Во, 1933; Hansen, 1 
Smith, 1944; Bjerrum, 1948; Meyer, 1937; Ekblom, 1933; Stalling, 1935) 


1. Cf. Pike дап and Swedish are excluded by defi, 

2. Cf. also nis ee For the local studies by Selmer, see the original 
Sis Reading, p. 41, footnote 3. 1 
Soli See now especially the researches GE 

uth Swedish, Carl Вогез топ, Martin Kloster Jensen and 
and the mathematical models of Sven Öhman, all listed in the 
Braphy of Fintoft (1970). 


Malmberg and Kerstin Hadding on 
d Arne Vanvik on Norwegian 
comprehensive biblio- 


Einar Haugen and Martin Joos 415 


"T 


ki 
> 


The method adopted in this study will be a combined phonetic and 
phonemic analysis. New, precise data derived from instrumental analysis 
Will be presented, and then analysed by structural methods to derive 
the relevant units of Norwegian pitch. Wherever they are pertinent, the 

- views of earlier scholars will be discussed and either accepted or rejected. 
Some of the views here presented are only tentative, since the geen ` 
available is still small, and the methods of structural linguistics are stil 
far from adequate for the solution of such difficult problems as ш 
offered by pitch. But it is hoped that some steps forward may be та 
toward a revised approach to these problems.* d 

One of the difficulties facing the investigator of intonation has been tha 
of gaining an objective picture of the tonal movement, particularly it 
longer utterances. Much of this has been eliminated by the wéiee ` 
the spectrograph, which makes it possible to determine the melo i 
movement with less effort than earlier (Joos, 1948). At the same time 1 
must be recognized that from a linguistic point of view the wg 
Tegistered by the ear alone are of the highest importance and cannot Ў 
eliminated іп favor of mechanical recordings, no matter how perfect. o 
the present investigation a text was chosen which should render natura 

Norwegian speech, a phonograph recording by the actor Hauk Азы 
The passage analysed is Spoken in a rapid, conversational manner, wit 
great variations of emotional expression, including examples of ques- 
tioning and exclamations. The dialect is standard colloquial Oslo speech, 
with a lapse into substandard in his imitation of the chauffeurs. 

The first step was to Prepare a phonetic transcription, marking the 
tones and stresses. The passage was then analysed by means of the spec 
trograph, and the intonational information transferred from the spectro- 
grams to semi-logarithmic paper. Photographs of the six charts resulting 
from this analysis accompany the present article. А new phonetic analysis 
was then made from the spectrograms, which is here presented along with 
the standard Norwegian Spelling of the text. Only those sounds are show" 
that could be positively identified in the spectrograms. Stress is marked bY 
ffected : l] primary stress with accent ty 
» [] secondary stress; these are auditorily 
by a colon after the sound affected: (2. 
ymbols are those of East Norwegian, [a 
ounded, [o] high back over-rounded, (Ч 


The sound values of the vowel s 
low back, [å] mid back over-r 


5. Hauk Aabel, b. 1869 at Sendfj 
thereafter, according to Нрет er Hvem? 1948 
Norsk Forfatter-Lexicon 1.1 (Kristiania, 1885). 


416 Perturbations 


high central over-rounded, [y] high front half-rounded. Word division is 
included for convenience in reading; the numbers in parentheses represent. 
the pause groups. Contour junctures (see below) are marked by perpen- 


dicular bars. 


Et Hekseskudd 
Hauk Aabel 


paserer pà Karl Johan, så far jeg et hekseskudd. 
ihan: |så "ја: је t |" hekseyskud: 


\ 


(1) En dag som jeg gar og 5) 
еп da:g|sm е 'gàá:r|à spa'se:ror|pà karl ја 
(2) Ganske plutselig, (3) uten noen slags forutgående fornemmelser 
"gansko| "plutsli "n tn |ndon sjlaks| \farut, gdana| får 'nemlsər 


(4) Og der stod jeg uten à kunne røre meg av flekken. (5) Ferst tenkte 
å 'de:r|'sto: jæ|"u:tn à kuno|'ra:ro те ја !flek:on forst "tengt 


jeg: (6) Ryggen må være gått av. (7) Du får vente litt, så kanskje den 
du få"vent lit:|sd "kàso n| 


je Is am må vero|"gát: 1а: 
EN or i sammen igjen. (8) Jeg stod midt i verste trafikken. (9) Rett imot 
'gro:arisamn jen је 'sto:|'mit: | "versto |ra'fik :an "тег: | то 


0) Så tenkte jeg: (11) Du fár 


meg kom det en bil, og rett bak meg en. (1 
sá"tengt je du fà 


me|küm de n bi:l| a ret 'ba:k те en 
rekke høyre armen i været, så stanser de kanskje. (12) Ja, jeg så gjorde, 
rek:o|'hayoro|'armon|i :'væ:rə|så "stansor ikansj  \jae|'jea|'sa| "jo-ra ~ 
(13) og bilene stanset ganske riktig. (14) Sjåførene skrek og bar seg: 
å "bi:Ino|"stansot | "ganska|"rikti sj\fo:rna|'skare:k là 'ba:er s 
(15) «Se til à komme unna der din idiot!» (16) Men det var meg ikke 
"sje told Като "un:a ræ| in "id:jot mn də'va: mei ke| 
mulig а gà. (17) Den hoyre armen min var nesten lam. 54 tenkte Jeg: 
'muli| а 'gå dən|'høyərə|'armən min|var "nestn | lam: |så “tengt ЈЕ. 
(18) Nà gjelder det à fà opp den venstre (19) for de begynner à kjere. 
'ná:|jelzar|  d"fd,op:|n 'venstro Yo:rli 'bjyn:orle "40:70 
(20) Men jeg rakk det ikke. (21) Begge bilene kjørte over meg. 
me je rak: d а Nbeg:a|"bi:lna| "co: 10: ma 
(22) Trafikken stanset og en av sjåførene kom bort til meg. (23) «Lever 
tra'fik:ən|"stansət|å 'eznla sja\fa:rna|kdm "Богі: 2 mæ 'le:vər 
De?» sa han. (24) «Ја-а, jeg tror det,» sa jeg. (25) «Men De fàr endelig 
ї san Ya: |j? тол ә sa ЈЕ mn di fér "endli 
ikke bry Dem noe om meg. Jeg har — (26) bekseskudd, Jeg} (27) Den 
Ka|'bryd:am пој om mei | ja а прекза Кид: Je den 


Einar Haugen and Martin Joos 417 


i i 
ene ау sjåførene — han så riktig så snill ut — (28) han la meg pent opp! 


. Ines: i 
есп alsja'fo:rna | an 'sá:|"rikti|sá 'snil:|'u:t han Ча: те! ре:гт |op 
bilen sin og (29) kjorte meg (30) like hjem.° 
еп зїп|й — "co:rt те "li:k Jem: 


f 
We shall now turn to the six plates which depict the tonal e 
this passage. The abscissa is calibrated into Sept o Ze ME 
ordinate into vibrations per second. The scale on the left- a pubs 
lates these into musical notes, while the scale on the right-han sing Co 
the actual number of vibrations. Small perpendicular lines cro ing 5585 
main curve show the approximate boundaries between neighbori KÉ, 
ments. The whole text is divided into pause groups from one to Mon. 
these will here be called utterances and references to them will be by n s will 
and centisecond, e.g. the d of en dag begins at U 1.18. The PE] té 
be convenient divisions from which to start our analysis, but Piin 
evident that they correspond only in part to the units that E pe 
from a grammatical-semantic analysis. In some cases the ѕреа! г: 
Speech has carried him over from one grammatical unit to another -€— 
pause (1, 7, 9, 11, 17, 22, 25); in others an accidental or deliberate SI 
Чоп has broken ир an obvious unit (2, 3, 26, 29, 30). But the great та} aan 
of the utterances do coincide with grammatical units, and the шоп нн 
contours do not run over from one to the next. For our purposes we E. 
thus consider them as autonomous utterances, and study their char 
teristics with a view to further reduction of their extent. ent 
An inspection of the curves shows that most of the musical тои 
is confined toa band from about d (150 ~) tob (250 ~), ora musical SIKU 
near the end, however, it sinks to B-g (125 ~-200 ~). This may rse 
regarded as tlie normal speech range of this speaker, and will of cou b 
vary greatly from speaker to speaker. The interval of a sixth correspo” 


6. Translation: (1) One day as I am out 


t an 
walking on Karl Johan (street), 1 8° 
attack of lumbago. (2) Quite sudd 


: ions. (9 
enly, (3) without any kind of previous sensations | 
There I stood without being able to stir from the spot. (5) First I thought: (б) MY 


er 
must have cracked in two. (7) You better wait a little, and maybe it will grow wech т, 
again. (8) I stood right іп the worst of the traffic. (9) Straight toward me came 2 up 
and right behind me one. (10) Then I thought: (11) You better put your right а) 
and maybe they'll stop. (12) Well, I did 50, (13) and sure enough, the cars stopped ar 
The chauffeurs screamed and yelled: (15) ‘See that you get out of there, you і ght! 
(16) But I just could not move. (17) My right arm was almost paralysed. Then 1 hee "t 
(18) If I can only get my left one up (19) before they start to drive. (20) But 1 the 
make it. (21) Both the cars ran over me. (22) The traffic stopped and one eal 
chauffeurs came over to me. (23) ‘Are you alive?’ he said, (24) * Ye-es, I think 50, um 
(25) *But whatever you do, don't bother about me. 1 have —— (26) a touch 9 icely 
bago!’ (27) One of the chauffeurs — he looked really very kind – – (28) he laid me >! 
in his car and (29) drove me (30) straight home. 


418 Perturbations 


09 ov 02 9 002 ову 091 ovt oct у 08 09 Op D 
KS e| f| | Јајој hy [тоју u [e| ре p еешер | о | ео | | in| e || fof ЗЛ [Р 


ti 


08! 09: ovt OCL 1001 08 09 ov oc O 1001 08 09 


ov oc 
Telst | ое е ue|e|B| 3| пе |» ||| е Ди el e|. |ш |? n| 


це [ц de (5 | {уе 


о 
ES 
Einar Haugen and Martin Joos 419 


082 109 ОРС Occ (002 081 091 09 oct 001 08 09 ov oc 
gise [s| 9 4 pJej ffe] (eist Jue чел | сјазеја јој [s| е |9|51с2іс| 8 "ш |51 6 (е 


Ki 
2 


рүш [e 


ујај огршс 


"ole 


LI 


09 ov 


бјејрелсјш ue 


20 Perturbations 


БЫ | | 
үш 


CC) GL 1 
WEEN ПД 
үе [ш 


KG 


: 
| У ке | ери [ејаје јаја 


(OI) Es 


Einar Haugen and Martin Joos 4 


ПА И И А ДЕ АДА ШШЕ 


PERR EERE Esae] ЕЕ БОРУ НА ДАРА АЦА 


PELs рее Ps 


с ош 


l6 Ola uà 


ооч 0 


"Ue 
oo 


? Perturbations 


2. 


Р 


h 


Einar Haugen and Martin Joos 423 


IR EE 
ШР 


i 


Hu 


4 Perturbations 


Beer ae ee a O 


for the two musical tones. For each such rise there is also a corresponding 
fall, so that in an average utterance the successive high points will be 
approximately equal. Storm described accent 1 as rising a musical fourth; 
accent 2 as falling a third and rising a fourth (Storm, 1884, р. 44). Alnæs 
estimated the rise in both'cases to be a sixth, the fall in accent 2 to be a 
third (Alnes, 1925, p. 27). Selmer found an average rise of a fifth, though it 
could exceed the octave in some cases, with a slightly smaller fall in accent 
2 (1920, pp. 65, 74). We may thus expect that the intervals between Aabel’s 
high and low points in the curve are those of the rising and falling accents, 
or word tones. We expect to find an accent 1 which is rising, and an accent 
2 which is falling-rising. 

But the curves as they appear on the spectrograms are not clearly 
divided into such units. The movement is everywhere continuous, with an 
up-and-down alternation, if we disregard the unvoiced intervals, It 
appears that if one did not know (by auditory means) where the stresses 
are located, it would not be possible to detect the characteristic word tones. 
If we compare the tonal movement of spaserer på 1.98-135 with that of 
hekseskudd 1.238272, we find that the two first syllables of each have 
almost identical appearance; for the moment we may disregard the high 
tone on the third syllable of the latter, noting only that in both there is à 
rise at the end. Yet we know that the first has accent 1 on the second 
Syllable, while the second has accent 2 оп the first, with light stress on the 
Second. Similarly if we compare jeg gar og spa- 1.55-98 with ganske 
2.1133 or i bilen 28.96 with bilene 21.40. Only if we locate the stresses, 
does a difference appear between the two tones, and then only in relation 
to the location of the stress. Wherever we have an accent 1, its stress Ке 
Dear the low point of the curve; in accent 2, the stress comes SE = 
usually includes the preceding high point, while the low point follows 


main stress 7 
The up-and-down melody of East Norwegian may thus be regarded as 
à kind of carrier wave for the accentual contours. When non-Norwegians 
Or speakers of other Norwegian dialects say that the East Norwegian 
‘sings’, this billowing movement is what they hear. Conversely, Mes 
East Norwegians say that the others sing, it is because they Un а те оду 
different from their own. In the words of Thomas Carlyle, * Accent is a 
kind of chanting; all men have accent of their own, — iac NE only 
Notice that of others’ (Alnæs, 1916). The melody is not in itself distinctive, 
г i n uneducated East Norwegian 
ee fe first syllable of a word like 
ble to see the difference in 


and we would be una! 
[. sjafforene 27.45; bilene 21.40 and bilen 28.96. 


7. An interesting consequen 
Speaker shifts the stress backward fro 
Spaserer, there is no change in melody, 
à melodic chart like the present one. C 


D Einar Haugen and Martin 425 
\ 2 i 


1.-23 


sp 


Kan 


but acquires distinctive value when it is associated with stress in de 
ticular way. The opinion that dynamic facts were more important | 
musical in creating the distinction between Accent 1 and 2 has are 
been advanced by Selmer (1928). But he was not willing to сапу e 
reasoning further because of the impossibility of precisely measuring om 
factor of stress. Although the nature of stress thus remains something o 
mystery, its auditory reality is unquestioned. 1 > 
БЕД oe have located our primary stresses, we will have no аасы 
in identifying most of them as associated with either accent 1 or д 
these have been described in the literature. Тһе falling-rising melody о S 
is especially conspicuous, e.g. in hekseskudd 1.238, tenkte jeg 5.24, КУ 
sammen igjen 7.110. Accent 1 is less uniform; it rises in stod jeg ei Ж 
4.162, falls in fornemmelser 3.140, en 9.200, falls and rises in imot 10. d 
veret 11.20. The fact that accent 1 can fall instead of rising was Sie 
already by Selmer, but he drew no further conclusions concerning a 
essential features of the melodic contrast (1920). This fall is calculated éi 
cast doubt on the traditional description of accent 1 as rising and 1 3 
relation to 2, the falling-rising tone. But in order to arrive at a new COn 
ception of the contrast, we ћауе to clear away another common error 
concerning the two accents, namely that they are ‘word tones’. It is E 
that when words are spoken in isolation and are stressed on the firs 
syllable, they show two kinds of melodic contours, accent 1 most often 
rising, 2 most often falling-rising. But if we consider words as they ge 
in utterances, we find that there is no basis whatever for identifying n 
with these melodic contours, If we eliminate the notion of ‘word’ entire 1 
from our description of the tones, we may then be able to isolate the гей 
contrast between them. 
The idea that the accents are word tones arose from the fact that each 
word, when pronounced as a whole utterance, has one or the other of the 
two accents traditionally associated with its stressed syllable. But in con" 
text the word may either have ог not have this accent, depending oR 
whether it still contains a stressed syllable; and under certain circum 
stances, it may even acquire a different accent (changing from 1 to 2 Si 
vice vereal? Furthermore, as we have shown above, the extent of the (а 
contour is quite independent of the number of words. The contours mon. 
without break from stress to stress, so that together they constitute Ko 
tonal movement of the entire utterance. We need only cast an eye оп! d 
text before us to sce evidence of this, In utterance 6 the five words ryge” 


GE e 
má vere | gátt ау are divided into two tonal contours. Nearly everywher 
the contours cover anywhere from two 


melody includes a whole sentence of fo 


е 
to five words; in utterance 23 th 
ur words, 
8. For the rules see Alnæs (1925, pp. 30, 34). 


426 Perturbations 


a The use of ‘word’ in this connection has troubled previous investigators 
| roch writes that ‘the idea of a “word” in relation to the two tonic ai 3 
is to be taken in a wider sense than the usual grammatical опе? dE 
р. 86). Elsewhere he says, ' Der rhythmische Abschnitt hat somit у ES 
unbefangenen ungekünstelten Sprechweise gewissermassen die su a 
des “Wortes” tibergenommen’ (1939). The fact itself has been Ж о. 
and Alnæs has created the word ronelagsgruppe to describe a group of d 
held together by a single tonal contour. ‘The word melody’, he writes 
can include more than a single word’ (1916, p. 92). This is equivalent to 
saying that the word melody is not a word melody, and has led to an 


unfortunate terminological situation. 
„The difficulty has arisen because previou: 
distinguished between the structural or wort 
Ka and their contextual or syntactic function. In the lexicon, where 
word appears in a full form as distinct from all other words as its 
ER in isolation permits, the accent appears as a word tone. 
Sut in context, the accent is a property of the utterance as a whole, and 
its function must be seen in relation to the stresses which form the rhythmic 
movement of the utterance. Here we cannot зау that ‘the word tone is - 
extended to include many words’, but that each measure has its melodic 
Contour, and that the word has the melodic contour of a measure or even 
Of a whole utterance whenever it constitutes a measure or an utterance by 


i s " 3 - 
“аи In any case, the ‘word’ is a semantic-grammatical unit, and as such 
irrelevant to our analysis of the tonal movement of the utterance; we do 

ining what is а 


Not at this stage of our analysis have any way of determi: 


‹ 
Word’, Instead of speaking of words, we shall therefore speak of measures, 
h one of which contains à stressed 


and divide our text into measures, each О 
syllable and includes a complete tonal contour. The beginning and end of 
а contour will be considered as constituting à tonal juncture, usually 
Coinciding with a syntactic break.’ 

We now find that we have two 


s investigators have not always 
d-differentiating function of the 


kinds of measures, according to the 
accents which characterize the stressed syllables. But is the entire contour . 
Of these contrasting accents really relevant to the difference? If we compare 
two final measures like flekken 4.160, with accent 1, and hekseskudd 1.235, 
with accent 2, we see that the difference between them is localized in the 
сапу part of the measure, preceding the low point. If we compare two 
Non-final measures like sto ЈЕ 4.40, with accent 1, and ganske 2.0, with 
accent 2, we find the same thing. In each case there is a rise from the low 
Point around d which appears [0 be independent of the preceding parts 


by Olaf Broch (1 
fined as à tendenc 
| terms measures. . • j 


935; 1939); cf. his statement 


9. The 
word ‘takt’ has been used 
у р. 104) *... the basic norm may ђе де y to produce stress- 
Ves of a certain size, i.e. length-units, or in musical 


Einar Haugen and Martin Joos 427 


SR 


de ur 


of the curve. A study of the measurements made by Selmer of words in 
isolation confirms the hypothesis that the melody which follows the low 
point is independent of the part that precedes. We can thus divide the 
contour of the measure into two parts, calling the first, which contains 
the tonal distinction, the (tonal) nucleus, and the second, which follows 
the low point, the (tonal) satellite. The nucleus is not necessarily identical 
with the stressed syllable, but it must include some or all of it. 

If we now compare the nuclei of all accent 1’s, we find that they have in 
common nothing more than the presence of a relatively lower note some- 


- where in the stressed syllable. In contrast, the accent 2's may have а 


preceding high, which is regularly followed by a fall to a low point that 
often comes in the following syllable. But, it may be objected, what about 
monosyllables like ut 27.210 or whole utterances like 23 with a steep rise 
from beginning to end ? Here, too, we propose to find a nucleus consisting 
of a low, followed by a rising satellite; the contrast with a corresponding 
accent 2 would only make itself felt in the opening part of the contour. 
Some have been tempted to regard monosyllabic utterances as lacking in 
distinctive tone because of the inability of these to take accent 2. But when 
we divide each measure into tonal nucleus and satellite, it is seen that 
monosyllables fall into the same pattern as other utterances with accent 1. 
But. the greatest advantage of this distinction is that it makes room for 
those instances where the contour of utterances is falling or level. This i$ 
especially common with accent 1, as Selmer found when five of his thirty- 
two words failed to rise at the end; but even in accent 2 there were two 
which showed no rise.!? In none of these cases were the two accents con- 
fused; their distinctive features were still present, as we are defining them 
here. But Selmer assumed that he had secured ‘lexical pronunciations’ 
and that these were therefore pure examples of ‘word tone’. But every 
utterance, even of a word in isolation, must have a tonal satellite, and the 
examples measured by Selmer show that while the accentual difference is 
localized to the nucleus of the measure, the rest of the contour is dependent 
on other factors which we shall analyse later. 

We must now consider the formulation of our contrast between accent 
1 and 2. If we consider only the point in the tonal curve at which stress 
sets in (the beginning of the ballistic stroke), we could describe 1 as “low” 
2 as ‘high’. This was the solution of Carl Borgstrom (1937) when he made 


_ the only previous attempt to apply structural points of view to this prob- 


lem.?* But in view of the fact that a low is just as essential to 2 as to 1, this 


M. SUME N2 ы N12 hus, N20 været, N22 søndag, N27 ghrdsgutttjeneste" 
nt 2: stuepikene, N59 selskapene. For a discussion bet 1 and the 
author see Selmer (1954) and Haugen (1955), ОВА 


11. A somewhat different interpretation was advanced in his article (1947). [See now у 


428 Perturbations 


is not entirely satisfactory. Accent 2 is quite different from the usual 
German or English hochton, and even in Norwegian it contrasts with an 
expressive high tone, as we shall see. The most adequate musical descrip- 
tion is one which characterizes 1 as low, 2 as falling, or high-low. The 2 
always implies a preceding high and a following low, even when these are 


not actually present. 
But we need to push this analysis a step further. By our definition accent 


1 and 2 have a low in common; the low is thus not distinctive. But this 
leaves us without any relevant feature in accent 1, contrasting with а 
preceding high in accent 2; in Prague school terminology, one would then 
have to say that in this contrast 1 is unmarked (merkmallos), 2is marked 
(merkmalhaft). The low is relevant to primary stress only, or in other 
Words: East Norwegian stress is normally accompanied by a low tone. This 
has nothing to do with the presence of the two tonal accents; the same is 
found, for example, in South German. Accent 1 is therefore the accent 
which is accompanied only by the typical tonal quality of a stressed 
syllable; for this reason Norwegians generally identify it with the stress 
accents of other languages, even when these have the opposite kind of tone, 
In accent 1 the melodic nucleus coincides with the stressed syllable. But in 
accent 2 the melodic nucleus has a tendency to spread into the following, 
unstressed syllable as well. We must turn back to our earlier consideration 


of the melodic movement in East Norwegian. In both accents there is a 
- їп accent 1 the stress falls on 


Potential melodic curve of high-low-high; 

the low, in 2 it falls between the first high and the low, so that the difference 

between them is one of phase. It is often said that accent 2 is felt to be 

incomplete at the end of the stressed syllable; this is because the melody 
е is essential for its com- 


has not yet reached its low point. A new syllabl its 
Pletion; such a syllable is heard by the ear in many cases where it is not 
7.50, 10.30 еїс.). 


actually pronounced (e.g. 5.40, 

But if this is true, we glimpse the possibility of 5 
SO much in terms of tonal movement, 25 in terms of the extent of their 
nuclei. Accent 1 is characterized as а short nucleus, concentrating the 
relevant.tonal movement within the stressed syllable, 2 as & long nucleus, 
in which the tonal movement runs over into the next, The accents are often 
Called ‘monosyllabic’ and ‘dissyllabic’ because they are derived his- 
torically from respectively mono- and polysyllables; but if our definition 
is correct, this would also be an accurate synchronic description of them. 

Since this definition is independent. of the specific tonal movement of 


~ 

his paper 1962.] A popular book entitled Korrekt Dagligtale (1949) by Inger Bugge uses 
the terms mørk ‘dark’ for accent 1 and уз ‘light’ for accent 2, obviously to express the 
difference in initial tone (the writer has found that these terms аге immediately under- 
Standable to speakers of Oslo Norwegi: d 


defining the accents, not 


ап). 


Einar Наидеп and Martin Joos 429 


East Norwegian, it should be possible to test it by trying it out on other 


У Norwegian dialects having relevant tonal accents. A study of the measure- 


rà Je машы 


ments made by Selmer of the Norwegian spoken in Bergen, Sunnmøre an 

Stavanger shows that this is not only possible, but provides for the first 
‘time a common formula for these various melodic types. An impression. 
of the analysis that could be made will be given by placing side by side 
the schematic tonal patterns which Selmer has drawn to sum up his results. 
In order to make them comparable in terms of our formula, we have her? 


` drawn dotted lines to show approximately the borders of the stress 


syllables (including only the core consisting of the long vowel or the short 
vowel plus voiced consonant). 

АП three of the West Norwegian tonal patterns differ from the East 
Norwegian in having a high tone within the nucleus of both accents. 
this point they stand closer to the other Germanic languages. But 061“ 
wise there is a great contrast between North-west Norwegian (Bergen an 
Sunnmøre) on the one hand, and South-west Norwegian (Stavanger) on 
the other. In the North-west the nucleus includes a low as well, in bot 
accents, while in the South-west it does not. The two North-west dialects 
have a steep fall in accent 1, a more leisurely one in accent 2 extending int 
the following syllable. The common high-low melody of the nucleus 15 d 
Опе case concentrated in the stressed syllable, in the other spread out int 


_ the next. In the South-west the nucleus of accent 1 is high (the peak 9 


movement low-high-low, in which the second low is contour). The посте 


Pi. ч : . the 
of accent 2 is high-low-high (with a following low which is contour); ш y 
Second peak may simply be regarded as a repetition of the first, where 


. the nucleus is marked as extending over into the second syllable. It sho", 


ike У 
be noted that the second high is not a new stress; in a compound like f 


430 Perturbations 


place name Ystervag the second syllable has the second high, while the 
third has the secondary stress (Selmer, 1927, p. 53). The second high of the 
an auditory effect of a carry-over of stress. There is a 
dish stress conditions which suggests that the South- 
candinavian situation than the East 


nucleus gives rather 
parallelism with Swe 
west Norwegian may reflect an older S: 
Norwegian.'? 


An interesting consequence of the views here advanced is that they make 


it possible to draw parallels with the hypotheses concerning accentual 
conditions in Danish recently advanced by the Danish phonetician Svend 
Smith (1944; 1938). He has brought forward evidence to show that the 
glottal catch found in many Danish words is not the main difference 
between accent 1 and 2. Even when the glottal catch is missing, there is a 
difference in the innervation of accent 1 and 2. In the former it is short and 
intense, in the latter it is long and relatively weak. А comparison with а 
Danish dialect which has preserved the tonal distinctions lost in most 
Danish speech shows similar conditions to those of South-west SE 
wegian. In the Felsted dialect, as investigated by Marie Bjerrum (1948, 
р. 53), accent 1 has one high and is relatively short, while accent 2 ot 
highs and is relatively long. It seems very probable that the со Ze 
tween a relatively short, dynamically (and therefore musically) inte! 
nucleus and a relatively long, but weak nucleus is the essence of the accen- 
tual contrast throughout Scandinavia. 

After this desen we shall now return to the text ane EE 
examples of abnormal nd haie e сода qom en 
14, 15 and 16 we have a series of nuclei W їс О i 
The tonal curve is here practically level, with only De WEE 
the usual melody. The speaker is shouting in 14 and EE des 
in both cases the average level is about еза dim. ББ this as 
Where is characteristic of unstressed syllables. Апаз е e 
‘dvælende betoning’ (drawn-out tone), which occurs in ec кше oF 
speech’ (1916, р. 89; 1932, РР. 55, 68). He ind 4: | е dramatic 
shouting, of exclamations like ју! ог skam! ‘shame , Я make himself 
declamation. It may even be used by a man who is RU ка Пра 
heard on the telephone: Det er Lund! (1916, p. SE Gs ere. 
kind of speech the distinction between accent 1 sc? isappears; сї. 
16.15 and mulig 16.50, sjåførene 14.0 and unna 15.50. ( 

12. Cf. the measurements of N. C. Stalling, Se i SA ee 
He defines (р. 173) accent 1 85 high (with falling contour), КУ 
high-low-high). i i rnin; inter- 

13. The Dn line in utterances 16 and 24 is SE e H 


octaves 

pretati ms; the two. oc mee pcr 

esie с, of Some writers on the subject; the effect is similar to fal- 
» 


setto, 


Einar Haugen and Martin Joos aa 
H 


Ax 


find 

Within the nucleus of a Norwegian measure we e ер 

three possible tonal accompaniments. But we cannot = kbar И 

3 to account for the last of these. Its high tone is e 

either one of the two accents, but to the normal low ton Eum 
low tone is the normal accompaniment of primary stress; 


. expressive. 


1 now 

Having determined the essential contrasts of the Se 

turn to the satellite. Some satellites are non-final in the u ally called 

are final. Since it is generally agreed that most of what is ш dër. 

“sentence intonation’ is concentrated at the end of the wk m 

expect that the study of the satellites will lead us to some Emer" 
about sentence intonation. In this study, however, we shall n: 


i г ere is any 
. expression, since it does not appear from our material that thi 
e 


e i intonation. 
special sentence intonation, any more than there is a word i 


f E | " coincide 
. Like the word, the sentence is a grammatical unit which does not j 


$ mined, if 
with any particular intonational contour, and can only be deter 
at all, at a later stage of analysis. 


Р : ies of non- 
If we study the satellites, e.g. in utterance 11, we find a series 


Y » usu 
final contours which rise from the usual low of Aabel’s speech to the 


CH e 
high, about g. These reach а point which is also the heating a s 
following measure. While this is the usual non-final contour, it is rat e? a 
common in final position, But it does occur, for example, in the јади 22, 
5, 10 and 17 which end with the words tenkte jeg, also in utterances ^^» 


E ; con- 
28 and 29. As will be seen, all of these involve the expectation of à fe 
tinuation. But this is not stri 


ht 
expression like så tenkte 
50”, or even hya tenkte 


tly necessary, since опе can easily mW 
Jeg *then I thought" into det tenkte jeg tonal 
Јев ‘what did I think’, without changing the E 
satellite. There is a neutra] quality in this satellite, common to non 
and final positions, which makes it usable for incomplete wx 
also for questions or complete Statements if these are spoken IP 
unemphatic and unemotional way. Its function is to fill out and comP 


У te 0 
the measure, and its final high note is in contrast with the low ПО 
the stressed nucleus, In di 


nucleus is high, the 


432 Perturbations 


Да " 


these high finals is clearly one of animation; the high notes ex 
Speaker's interest in what he is saying. Lower notes in the same veu Es 
Would have reduced the excitement of the narrative. Alnæs has poi (Ол 
this function of the rising final contour; anyone who has EET ES 
Norwegians speaking, especially young girls, will have noticed the SE 
Oslo tone’, with its cheerful, almost twittering quality (1916, pp. 107-8), 
A comparison of the question in utterance 23 with the other high finals 
Shows that, as Alnæs has maintained, there is no special tone for questions. 
But it is the usual thing for questions to be spoken with a high final, at 
least when the speaker is interested in the answer. The effect of the extra 
high final is one of appeal to the listener, and it is therefore usually 
avoided in reading, especially in serious or impersonal material. The high 
final, with a pitch only slightly raised over that of the normal, is the one 
We find in most of Selmer’s recorded examples and in Storm’s and Alnes’s 
Notation. Here it is not one of expressive appeal, but of ordinary finality 


with sustained interest. 

The interest is not always thus sustained to the end of the utterance. In 
our material we have low final tone at the end of utterances 3, 8, 13, 15, 
21 and 30. Of these, 15 must be eliminated at once as exemplifying the low 

14 There is not much 


final after an exclamatory stress, discussed above. 

difference in finality between these and most of the rising satellites. The 
general rule of low tone for finality applies only in part to ordinary East 
Norwegian speech, even though it is often taught in Norwegian schools 


and shows its effects in a typical ‘reading tone’ which good teachers 
al, or at least west European 


attempt to discourage. This internation: 
tradition, is in conflict with the movement of natural speech in Norwegian; 
it may go back to a medieval practice.'? Yet low final is also a part of 
Norwegian speech. Alnæs has formulated the rule that *if the emphatic 
Stress comes at the end of the sentence or word combination, the melody 
ends rising; if the emphasis is:placed at the beginning of the phrase or 
sentence, the sentence melody is falling" (1932, pp. 7, 56, 65, 75). He shows 
that if two or more equal stresses are combined into one phrase, the last 
receives the main stress unless there is some reason to stress an earlier one. 
In smor og brad ‘bread and butter" the second stress is stronger than the 
first and gets rising tone, while in mine damer og herrer ‘ladies and gentle- 
formularistic nature of the 


теп”, the second is weaker (because of the 
Combination) and gets low final tone. This theory appears to be borne out 


by our materials, since we find that in several of the utterances with low 


14. Alnæs makes the error of discussing this kind o! 
Others, though they have clearly different functions. 4 : 
15. Cf. the medieval melody used for teaching the proper intonation at commas 
periods (fall), cited in Alnæs (1916, p. 152). 


(small rise), questions (large rise), 


Einar Haugen and Martin Joos 433 


f falling final together with the 


final, the main stress comes early; e.g. in 3 the stress is on noen 3.45, in 8 
on verste 8.90. Е 
The contrast between high and low final thus seems to be associated 
with the distribution of stress among the measures of the utterance. Jf we 
assume that each utterance (or perhaps we should say * phrase") has one 
primary stress which is emphasized beyond the others, we may say that 
the high final may mark the last stress as emphatic, while low final points 
back to some previous stress as the emphatic one. Alnæs gives many 
examples of utterances which must be read with emphasis on a preceding 
stress to make sense, e.g. Øyvind het han ‘Øyvind was his name’ ог han var 
som en ungdom frisk ‘he was as chipper as a youth’. The utterance stress 
- hasa function which points beyond the utterance to other utterances before 
or after as well as unifying the structure of the utterance itself. It makes it 
possible for the speaker to indicate what is new and important In his 
statement, In utterance 1 our actor holds the whole statement together 
and marks the final word as emphatic by the sharp rise at the end. In 
- . utterance 2 there is less novelty and therefore normal tone (disregarding 
= the dragging tail). In utterance 3 he simply repeats in other words the 
contents of2, so that there is additional reason for allowing it to sink at the 
end. In utterance 4 comesa new and surprising item of information, which 
accordingly ends far up in the clouds. Whatever finality the low tone has 
in Norwegian is due to this tendency to slacken interest near the end of 
predictable sequence. Its use in reading and lecturing is understandable in 
view of the speaker's awareness of each approaching end. By de-empha- 
sizing the end one also avoids putting too much of one’s own personality 
. . into an impersonal statement. 


We are now ready to generalize our results by expressing them in terms 
of the basic contrasts discovered. 


1. The utterance can be divided into one or more tonal measures, each 
characterized by containing one primary stress, and the measures into 


tonal nucleus containing the contrast between accent 1 and 2 and a tonal 
satellite which fills out the rest of the measure, 


‘ 2. The nucleus is normally accompanied by a /ow tone, but may have 
expressive high tone; this contrast is morphemic, since it conveys meaning 
directly. 

3. The low tone of the normal nucleus may come within the stressed core of 
the syllable, resulting in accent 1, or Shortly after it, resulting in accent 2? 
this contrast between a short and a long nucleus may be regarded E 


Phonemic, since it distinguishes otherwise identical words and 145 
semantic function of its own. 


434 Perturbations 


| 


4. The normal satellite rises from a low point at the end of the nucleus to 
a high that may be approximately the same in final or non-final position; 
this high contrasts with the low of the nucleus, but has a neutral SE 
in relation to the statement as а whole. 

be either higher or lower than that of the normal 
emphasis within the utterance is to be expressed; 
the last measure, a low final de-emphasizes 
have a syntactic function. 


5. The final pitch may 
satellite if some special 
à high final lends emphasis to 
it, so that these can be said to 


6. A high final may be augmented as à dramatic device to show the 
Speaker's interest in the statement and appeal to the listener's attention; 
like the high tone in the nucleus, this may be regarded as an expressive 


morphemic variant. 

7. If it were desired to set up levels of Norwegian pitch similar to those 
often used in describing American English, three would probably be suf- 
ficient (low, high, extra high), though one might want to add a plus to the 
extra high for the expressive morphemic variant and a minus to the low 
in some cases of extra low finals; if one numbers them from below, accent 
1 would be 1, 2 would be 2-1, а normal satellite would be 1-2, high final 


1-3, low final 1-1. 


The difference between (East) Norwegian and other Germanic languages 
should now be clear. Norwegian has а second (tonal) stress accent where 
the non-Scandinavian languages (plus Icelandic, Faroese and Finnish 
Swedish) have only one. East Norwegian has low tone with stress where 
English and North German normally have high tone. The unstressed 
syllables of the satellite rise in East Norwegian more often than in the 
other languages, and the contrast of high-low final is used in a special way. 

Otto Jespersen, the Danish linguist, once suggested that the Norwegian 


and Swedish ‘word melodies’ might ult to express the 


make it more diffic 
nuances of thought and feeling than in other languages (1897-9, P. 606). 
This opinion can hardly be maintained in view of the anal 


lysis made in 
our study. It has been shown that the two accents are irrelevant to the 
tonal contour of the utterance as а whole, constituting as they do together 
the equivalent of the high stress tone of other Germanic languages. 
Beyond this, Norwegian has means of tonal variation within the measure 
and the utterance which correspond to those of other, related languages. 


Jespersen's reaction is probably due to the non-native's difficulty in hearing 
dly different from his own. Though this has not 


nuances in systems marke um | 
been the theme of the present study, further variation. of expression can 
location of the stresses and altering ` 


of course be produced by changing the 
the tempo, so that the pauses ‘will fall differently and thereby create new 


Einar Haugen and Martin Joos 435 


utterance groupings. The size of the intervals between high and low can 
also be altered for expressive purposes. Every linguistic structure possesses 
infinite possibilities of variation if its speakers have the need and the desire 
to make use of them. 


References 


ALNÆs, I. (1916), Norsk Saetningsmelodi, Oslo. 
ALNÆS, I. (1925), Norsk Uttaleordbok, Oslo. 
Атм, I. (1932), De Levende Ord, Oslo. 
BJERRUM, M. (1948), Felstedmaalets Tonale Akcenter, Aarhus. 
Bo, A. (1933), Tonegangen i Dansk "Rigsmaal, Copenhagen. Т. 
Вокозтком, С. (1937), Norsk Tidsskrift for Sprogvidenskap, vol. 9, pp. 260- | 
Вокозтком, C. (1947), ‘De prosodiske elementer i norsk’, Festskrift Broch, Oslo, 
. 41-8. 
Seene C. (1962), ‘Tanemes and phrase intonation in South-Eastern standard 
Norwegian’, Studia Linguistica, vol. 16, pp. 34-7. 
Вкосн, О. (1935), Transactions of the Philological Society, pp. 80-1 12. "E 
Вкосн, О. (1937), *Begriffsunterschied auch Intonationsunterschied in dem О 
wegischen", Mélanges Holger Pedersen, Acta Jutlandica, vol. 9, no. 1, pp. 308-22. 
Вкосн, О. (1939), ‘Numerusunterschied durch Intonationsunterschied in Ostnor- 
wegischen’, Travaux du Cercle Linguistique de Prague, vol. 8, pp. 116-29. 61 
Brocu,0.(1944),‘Tonelag bestemmende for lydutvikling’, Maal og Minne, pp. 145-67 
Виббе, I. (1949), Korrekt Dagligtale, Oslo. 
Еквгом, К. (1933), Om de danska Accentarterna, Uppsala. 


Fintorr, К. (1970), Acoustical Analysis and Perception of Tonemes in some 
Norwegian Dialects, Oslo. 


HANSEN, А. (1943), Stødet 


i Dansk, Copenhagen. 
HAUGEN, E. (1949), B 


*Phoneme or prosodeme?', Language, vol. 25, pp. 278-82. 
HAUGEN, E. (1955), "Tonelagsanalyse', Maal og Minne, pp. 70-80. у 


«р H ё 
HAUGEN, E. (1963), ‘Pitch accent and tonemic juncture in Scandinavian’, Monatsheft Н 
vol. 55, pp. 157-61. 


HAUGEN, E. (1965), Norwegian-English Dictionary, University of Wisconsin Press 
HAUGEN, E. (1967), *On t 


nes he rules of Norwegian tonality’, Language, vol. 43, PP 


JESPERSEN, О. (1897-9), Fe netik, Copenhagen. 
Joos, M. (1948), Acoustic *honetics, Language Monograph по. 23. ` 
MEYER, E. А. (1937), Die Intonation im Schwedischen, Stockholm. 

OFTEDAL, M. (1952), Norsk Tidsskrift for Sprogrindenskap. vol. 16, pp. 201-25. 
Pike, К. L. (1948), рле Languages, University of Michigan Press. 

Ge Fa (ушу Enkelt og dobbelt tonelag i Kristianiasprog’, Maal 08 


SELMER, E, W. (1927), Den musikalske akse 


nt i St lo. 
SELMER BAW. (1928) N I Stavangersmálet, Oslo. 


osses’, A.Ph.Sc., vol. 12, pp. 33-9. 


Rigssprog, Copenhagen. 
STALLING, N. C. (1935), Das ea 


Phonologische Syst, dc ‘hwedischen 1, Nijmegen 
STORM, J. (1860), Mlustreret М. 'yhedsblad, no, АЕ н 
STORM, Ј. (1884), * Norsk lydskrift’, Norvegia, vol. 1, pp. 40-56. 
5товм, J. (1892), Englische Philologie, 2nd edn, vol. 1, Leipzig. 


436 Perturbations 


Part Eight 
Varieties of English 


Of all the influences that languages in contact wield upon one another, 
those of the prosody are most subtle and resistant to attempts to 
regulate them according to some standard. An intonation may persist 
long after the other remnants of a language have vanished, as happened 
with the Cacana language in South America. As English has expanded 
around the globe, or as large groups of speakers of other languages have 
formed enclaves in English-speaking territory, the varieties of English 
that have resulted from the amalgam carry а residue of other accents — 
the English of India has its characteristic intonations, as does that of 
Hawaii and that of American Blacks. A sample of each of the latter 

two is offered. The first, by Vanderslice and Pierson, has been slightly 
expanded by the principal author from its original version. The second 
is taken from Lorenzo Turner’s pioneering study of Gullah, a dialect 
spoken on the Sea Islands of Georgia and South Carolina and the 
mainland coast nearby. What distinguished Turner’s work was its break 
from a kind of dialectology based largely on geography that American 
linguists inherited from Europe, where geography has always been the 
most powerful factor in separating one dialect from another. He showed 
that Gullah, in its intonation and in other features, resembled certain 
West African languages in ways that could not be put down to co- 
incidence, The significance of this — that t 
another variant of English as it was transp 
developed into a regional form of speech in America — for several years 
escaped American dialectologists. But it has finally caught on, helped 


by the vigorous concern with social groupings, particularly urban classes 
ck English. That intonation 


and ethnic minorities and specifically Bla 5 | i 
Should be singled out by Turner bears witness to 15 persistence, its 
tendency to live on when other features are submerged. 


orted from England and 


“аи Uu 


he Gullah dialect is not merely: 


A 


5 


‚ the 400,000 contract laborers imported 


26 Ralph Vanderslice and Laura Shun Pierson 


Prosodic Features of Hawaiian English 


Ralph Vanderslice and Laura Shun Pierson, ‘Prosodic features of Hawaiian 
English’, Quarterly Journal of Speech, vol. 53, по. 2, April 1967, pp. 156-66. 


"А hateful jargon’, ‘a lingo of lesser breeds’ (Carr, 1961), ‘an unintel- 
ligible gibberish which passes for English’ (Lind, 1960), ‘а desecration of 
4 an abomination in the sight of the 


the greatest language on earth, an 
Lord’ (Honolulu Star Bulletin, 13 February 1962). These are some of the 
epithets which have been hurled at Hawaii’s Pidgin English. 
The Fiftieth State has had a unique linguistic history. Captain Cook’s 
discovery in 1778 of the ‘Sandwich Islands’ with their indigenous Poly- 
nesian culture; the arrival of New England missionaries in 1820 to coun- 
teract the immoral influence of the whaling fleets and, incidentally, teach 
English; the growth of the sugar industry after 1860, with massive immi- 
gration of contract laborers from various parts of the world to work the 
sugar and (after 1900) pineapple plantations; the overthrow of themonarchy 
(with American connivance) in 1893, leading to US annexation in 1898 
and ultimately to statehood in 1959; all have influenced the development 
of the English dialect spoken in Hawaii and known locally as ‘Pidgin’. 
Although an English-based pidgin – in the technical sense of a highly 
simplified lingua franca used in а contact situation where it 15 native to 
neither side (Hall, 1966, р. xii) — arose during the first; century of the 
Hawaiians’ contact with the outside world (while internecine warfare and 


‘western’ diseases reduced their numbers from 300,000 to 44,000) it was 
between the 1860s and 1932 from 


Rico, Korea and the Philippines who 
from which the currently de-creolizing 
erm for this dialect in Hawaii and 


China, Portugal, Japan, Puerto 
chiefly shaped the plantation pidgin 
dialect is descended. Pidgin is the usual t 
will be so used here. a 
Today this dialect is socially embedded and fraught with connotations 
of race, class and group loyalty. Scathing censure by schools and news- 
Papers has not discouraged use of Pidgin by the youth of the non-Caucasian 
majority (Shun, 1961, pp 1-9». — . 
There has been a dearth of descriptive 5 
unsuccessful efforts to ‘stamp out Island Dial 
as ‘careless speech’ or ‘bad English’. Even suppose 


tudy amid the vigorous, but 
lect". Pidgin has been treated 
Фу scholarly studies 


Ralph Vanderslice and Laura Shun Pierson 489 


23 4/9. UL 


Si (ve, 


óften turn out to be merely classifications of “most commonly encountered 
errors’ (see, e.g. Kasdon and Smith, 1960). 
The most neglected aspect of Pidgin has been its suprasegmental or 
- prosodic features, and it is the purpose of this paper to describe the salient 
prosodic features of Hawaiian English. By prosodic features we mean 
what Abercrombie calls features of voice dynamics; in particular rhythm, 
pitch fluctuation (intonation), tessitura and register (1967, p. 89). By 
salient features we mean those in which Pidgin contrasts with General 
American English.! We use СА Е to refer to the set of American English 
dialects which share the features under discussion in contrast with 
‘Hawaiian American English (HAE), which we define as the English 
spoken in the state of Hawaii by native or long-term residents whose 
speech is marked by typical regional characteristics. The latter term thus 
covers a dialect continuum from standard HAE to broad Pidgin; except 
after such attributives we use Pidgin and H AE coterminously. 
" Hawaiian Pidgin differs conspicuously from GAE not only in features 
of voice dynamics but also in segmental sounds, grammar and voca- 


EM Certain of these correlated attributes appear in the examples 
low. 


Rhythm, tessitura and register 

Isosyllabism. The rhythm of Pidgin is basically a syllable-timed rather than, 
as in most dialects of English, a stress-timed one. Syllables tend to have 
equal prominence in terms of loudness and duration, and to succeed each 
other eh regular intervals with an effect ‘like the steady tapping of a DC" 
writer' (Linn, n.d., p. 7). The opposition between weak- and strong- 


stressed syllables is largely leveled, especially that between content and 
function words: 


а) 


eg ? 9, SS 
ө ә іе: о оо $ e * о о ооо 
He said it was personal, and he couldn't release it without а requisition: 


This is a sample of Standard HAE which differs o c 
nly ph г rom à 
comparable GAE utterance: y phonetically fi 


He said it was personal, and he couldn't release it without a requisitio™ 


1. The contrasts are in particular with the North Midland dialect of the senior а 0" 
but we believe they hold for a rather wide range of dialects spoken on the mainland. 

2. The redundancy of this term seems worth tolerating to forestall a misleadin® 
contrast between Hawaiian and American. Hawaiian is used in its regional, not its 
ethnic nor its linguistic sense; of course G A E is also widely spoken in SL 

3. For a good (but dated) study of the syntactic and lexical peculiarities, see Reinecke 
and Tokimasa (1934). See now also Reinecke (1969 — revised version of 1935 thesis)” 


440 Varieties of English 


The most noticeable prosodic feature of (1) is its isosyllabism. The terminal 
pitch pattern is also characteristic of Pidgin and will be discussed under 
Scoop. К 
It should be noted that the impressionistic term choppy as used in local 1 

‘speech improvement’ training subsumes both this syllable-timed rhythm & 
and the frequent occurrence of intrusive glottal stops before syllable-initial 


vowels in Pidgin. 


Drawl, Although Pidgin syllables tend to have equal duration, ceferis \ 
paribus, words of special semantic importance are often extended or 4 
drawled to an extreme degree: D 


@) 


b 
x 
yesterday? Was тела! goo boy! 4 


The falling intonation on yesterday exemplifies the Pidgin pattern for г; 
general questions discussed below. What is to be noted here is the Е 
lengthening for emphasis of real and good, the latter lasting on the order | 
of a second. We may summarize the rhythm of HAE, then, as basically 
syllable-timed but with marked drawling of occasional syllables for н 


emphasis, 


Eh, you went go show 


Tessitura. The *characteristic range of notes, or compass, within which the 
pitch fluctuation ... falls’ (Abercrombie, 1967, p. 99) is generally wider 
in HAE than in GAE, and more frequent use is made of the higher pitches 
within that tessitura. The wide tessitura tends to be interpreted as ds 
affective index by G АЕ speakers, to whom Pidgin therefore often soun! 


markedly enthusiastic or excited. 3 
A d 

огу 'voice quality" modifications arising 3 

f adjustments of the laryngeal structures ! 


affecting phonation. These modifications are transitory ушр 
With the quasi-permanent features of an individual's voice 4 i at 
but their time domain is usually long with respect Hu ec: across ; 
adjustments employed as segmental features, [2:2 " 54 EE 
стеаку voice ог breathy voice аге criterial features e Ke nn Ee 
distinctions (see Ladefoged, 1964; Catford, 1964). 5 


а i falsetto. 
which play a significant role in HAE are raspy UH Sg tion shouldbe 
4. The sentence means ‘Did you go to the show E Mrs ER ROUES у 
read with the typically monophthongal [e and[o Va nr The vocative boy is here 
Which form compound tenses with the unmarked infinitive. 


Used by one girl speaking to another; man, guys as vocatives are similarly ungendered 
in HAE, А 


Register. Registers are transit 
from changes in the complex o! 


Y 


derslice and Laura Shun Pierson 441 


Ralph Van 


IU rd 


ME 


jJ 
e 


Raspy voice is technically a voiced ary-epiglottic trill. The vocal cords | 
vibrate in the usual way, and in addition the collar of the larynx constricts 
‘in a sphincter-like closure and vibrates at a lower frequency. This pro- 
duces a rough quality which is apparently a permanent feature of voice 
quality for some speakers — notably Louis Armstrong — but in Pidgin is 1 
employed as a register which is brought into play for short periods, 
especially on drawled syllables, as a sort of intensifier. Its use is more 
common among, but not restricted to, male speakers. 
Falsetto on the other hand is a register restricted to female speakers, | 
except for jocular use by males comparable to that in С АЕ. Many female | 
"Rat speakers regularly produce their upper levels of pitch, within the 
wide tessitura previously noted, with falsetto phonation. This use of falsetto | 
register is found in standard Н A E as well as broad Pidgin, whereas raspy 


Intonation 


ЊЕ ассет. Word stress 15 but loosely fixed in isosyllabic HAE and 18 
e identifiable only by the occurrence of an accent or pitch obtrusion 
(see Bolinger, 1958), which is more regularly at the end of intonation 
clauses in Pidgin than in GAE; usually on the penult or ultima. Thus 
ies ma Pronunciations as: hospital [has'pitol], operate [арәче:0, 
ле E Тин is particularly noticeable in compound e. 
"ка, summerti, ng stress in GAE: snack bar [snek'ba:], crewcut [kt 
ШОУБУ: ЫА, Ва ma 'taim], Volkswagen [voks'weegon]. ; E. 
tive-noun prose Placement is not entirely predictable. Ordinary alee 
contexts) as if the are often forestressed (even in quite noncontrar an 
There is, in fen Were compound nouns: a pretty kitten [o 'pat^i К И € A 
but a strong tend а certain randomness in the location of accent in H 
Pm Per e SE for it to occur at the ends of clauses. In any ca Б 
Pidgin as it does ER not perform an information-pointing fonction И 
least redund: їп GAE, where it is usually associated with the po!” 
Tedundancy (Bolinger, 1958; Hultzén, 1959). In Pidgin the acce 
location, usually clause-final, is independent of EE ог contrast: 


e ~ о ө e e e o Эх e 
HM UD E ESSO i. N. e "э ш 
But now, we will loCATE it for YOU, Ыш... 


‚122 Varieties of English 


Тһе Speaker of (4), from the Honolulu Board of Water Suppl 
explaining a policy under which his department could no SE i ie 
break in the pipe serving a private house, although they would still ee Ww. 
In a comparable G A E statement, contrastive accent would be Um C 


(5) 


D D e is, 
We'll still  LOcate it for you, but... 


Another example of insensitivity of Pidgin accent-location to implied i 
contrast is a male student's reply (7) to his friend's contention (6): 


(6) i 
е [5] е Kei • а i 
English is easy subject. % i K 
0. рсө d 
Bee 
& 


Not in my CLASS boy 
Even where there is explicit contrast within the immediate context, a А 
redundant reiterative element will usually be accented if it is clause-final | : 


(speaking of land in Hawaii): 


(8) 6) 
© ө ө ө © о өөө ei 
Forty-tree per cent is gavament OWNED i Wi 
(i. Tee у ч 
e fe ore? өөө ө 97 | 


ijs privately OWNED. у 
ry in GAE at least in (ii), not only 
rnment, but also because owned has 
ptional in (i) – i.e., the con- (s 


an' fifty-seven per cent 


A shift of accent would be obligato’ 
because privately contrasts with gove 
already occurred in the context. It would be о 
trast can be anticipated or not. Such an anticipatory accent-shift in a 
Pidgin utterance is shown in (9i) on THIS semester but (ii) the Pidgin 
pattern reasserts itself to obliterate the expected (by GAE speakers) М 
. Parallel emphasis on nine and next: 


0) (i) 
2€ | 
Pa NOU DEL Be сс 4 
Im supposed to take eighteen credits THIS semesta ae 
44 
T NEE ee Pa 
(ii) © N 


and nineteen credits next seMESta. 


Ше, 


© Thus at the same time that word stress in Н AE is both less conspicuous _ 
_ and less stable than in GAE, accent is less context-sensitive, being more 
regularly at clause ends rather than correlated with information point 
. even in contrastive contexts. 


Scoop. Pidgin statements usually (and special or interrogative-word ques- 
tions sometimes) take a rise-fall intonation which is very like the оша 
‘ponding contour of GAE except for the phonetic shape of the рис 
accent: 


(10) - 


ENS. • ө * e^ 


— Thats why the mice died so young. 


It still has its problems financially. 


(ip ————— 
E e 7 
e E 
Oh my — goodne:::ss. 


(13) - 
ЈА = ee e. Ө 
|... Where you drop your quarter? 
E^ P " er 
d ag accented syllable does not begin at the higher pitch as in GAE; SH 
5 PE OF part of it, as well as the fall takes place during the set, 
La SCH EE For this phenomenon we borrow Нос: 
op, extending it to incl А 2 lason the accel A 
syllable. (Pittenger, clude pitch rise after as well as 


sailar 
: Hock seiysimillf | 
intonation is descr ett and Danehy, 1960, pp. 193-4). A closely 

and especially in 


Sy 


ibed by Jones as used sometimes in Southern Eng апд 0 
Wales (1966, рр. 159, 161-2). 


' is 

CSC апа special-question tunes. The rise-fall with 00009 

alternate st : pattern for statements and special questions in H 

with Mexi ^tement intonation, reminiscent of that commonly 25 
3 €xican Spanish, is Sometimes heard: 


sociale 


~ 3 ~ 


444 Varieties of English _ : : |: 
MO слали ће Ы 5 й 


0 мы 
• ө се Ze о , 6 


SS 
She steh. I forgot to take her home. 


{ Interrogative-word questions (exclusive of reclamatory and echo ques- 
tions, etc.) have basically the same two patterns in both Pidgin and GAE 
One is identical with the rise-fall statement tune (sans scoop in GAB); 
the other, which has been curiously neglected in descriptions of GAE, 


is very common in HAE: 


6 === === ий 
ео о о e 


Where can І get some cups? 


Lr caesus e E 
(I) e 5 


© e 
What room is Doctor Boyer? 
(18) (i) ee MI -i c x 
Өө ө 5 v e ө 


To where you going Richard? To the wedding? 


Note in (18) that the terminal pitch rise of the special question (i) istaken - 


over by the vocative, as it would be also in GAE. 


dgin pattern for yes-no questions is a very con- 


Spicuous feature of the dialect, being markedly different from the GAE 
with rising terminal. The usual form of 


pattern of rising or high pitch . The i 
HAE general questions starts at ог quickly rises to high pitch level which 
lasts until just before the accented ultima or penult, on which there is low 
pitch with terminal steadying or slight rise: ; 


(15) лг E 


General questions. The Pi 


You bought milk? 


(20) тай д EE E 


ө o 
е 


You folks going 10 the hootenany tonight? 


Cl) oe 


Punahou cannot speak slang too? 


Ralph Vanderslice and Laura Shun Pierson 445 


BL 

_ @2) ———.9—$5959899989 ^ 
Ki Hd = 

; You need a general catalog? 

~ : | E 
Note that the accented syllable is the one after the cem ES CH 
downward pitch obtrusion is rare in GAE and "ew ES 2 C 
difficulty hearing it as a question-marker. A variant wit aer" 
the ultima is sometimes used (the vocative here being of course 

clause): 


Er (23) Pues 9 ж.о о 
€ a ie e 
You going home now, Jimmy? 


x г оп 
When the penult is accented, the pitch may fall in e SE а$ 
wedding in (181), or both syllables may be on low pitch (241): 


- Gi) e 
Q4) (i EU. = 
Does this cut metal? Soft MEtal. 


: У ; ce of con- 
(Note that the reply (ii) to the question (i) exemplifies the absen 
trastive accent shift in HAE.) А Y d for 
se 

An intonation quite similar to that for general Mg (eer " 
prepositive dependent clauses; it is especially noticeable e еп DEE 
longer than the conversational norm, as in this example from 
debate: 


(25) 


i ere o • ° 
Until the gavament realizes that there is 2 problem hi ү 


d d [08 
... Tag questions. A question tag is a special form which when append a E 
| Statement or command converts it into a question. The commones 


Ape е 
bject (with obligatory pronominalization) d. 
verb (with obligatory reduction to auxiliary or do) of the original AC. 
~ with verb inversion and negative-switching: He's going, isn " he? It tion 
= doesn't it? They can't come, can they? There are several distinct D ‘ 
choices, e.g. low-rising, high-falling, high-rising. Negative зс ап 
be deleted (You want to 80, do you?), but with special implication 


1 : iderable 
syntactic and intonational constraints apparently subject to vi 
dialectal variation, 


| ecta ir form: 
Б: Pidgin tags аге of interest because of their frequency and LN sel 
The GAE type just discussed is seldom encountered; rather a 5р are J^! 

of questioning monosyllables is used. The commonest of these i 


Я і і ү 
Беј, no [no], eh Реј, and huh [hà]. They usually have high pitch, 
terminal rise if Sentence final: 


~ tag form repeats the sul 


> 
| 446 Varieties ot English Р 


E 


z 2 эе 
I tink dats where I been go see you down there, eh? 


0) se oc 

~ аи : 

Poho ink, no? (Hawaiian роћо ‘waste’ — said of an exam) 
These tags very often occur in sentence-non-final position, followed either 
by the residue of inverted word order (28-30), or by more-or-less redun- 
dant material, especially vocatives (31-33). In either case, what follows 
the tag is at low pitch: 

(28) — — Ar M CN 

A. ж. E — 

Good, no, da kine? 


ae UE 
n ERC, OAE 


Tree-credit course, eh was? 


(G0) ge 
9-4 


uL) Lr 
Hard, eh, Shakespeare? 


О) ———s.*— e. 
e е, e е d 


You la:zy buggah, eh you? 


Lum 
32 
92) от eg 


E EE 
You Гета::је, eh Joyce? 


(33) пашу пау на UE on гис 


o Ze" ee е 
You didn’t knock over that can, huh, by my door? 
others which should be 


Besides these monosyllabic tags there are two 
de currency among the 


noted, The question tag yo" know? has very wi d ; 
younger speakers of H AE, but its distribution and intonation do not 
contrast with G AE usage except for a sharper pitch rise on the second 
syllable (a correlate of wide tessitura). The use of the tag-like phrase or 
what? on the other hand contrasts markedly. In GAE this is not a true 


Ralph Vanderslice and Laura Shun Pierson 447 _ 


E 


SR o 


TR A Пан Ц "т, 


question tag, but a stock second (or last) element for turning a yes-no 
question into an alternative one. The what normally has full stress and a 
high-falling tune. In Pidgin this phrase is appended at low pitch and stress 
= and without pause: 
у (mp c DET == 
- 3 © 

ө © E 
You been steh go or what? 


S ——7.A r 
(35) © © 

e © o 

You like get licking or what? 


` Whether the material preceding the tag is independently a question is moot 
"Aa іп these citations (and typically) because of the absence of verb inversion 


9 in Pidgin. Sometimes the tag or what seems to function merely as an 
expletive comparable to G АЕ *or anything": 
EE te ty За 
е 9 ee ed 
sp You never see him swimming or what. 


Vocatives. Pidgin calling vocatives tend to be like one of the common 
GAE patterns, with high pitch followed by a slight drop and terminal 
rise: 


e e e 
ааа 10712 PENA ‚Чык ЖЫ ee 
Mrs Maurer, will you call three two eight one? 


The mid-rising and high-falling call patterns of GAE seem not to occur. 

Conversational utterance-initial vocatives in Pidgin usually have 3 
falling intonation with very conspicuous scoop, as opposed to the rising 
of fall-rise patterns typical of GA E: 


69 TA. 


Alfre::d . o 


(39) e ` 


William... 


(40) 
Ze ох“" • ee 
> Try che— Nora, try check this for me. 


__ 448 Varieties of English 4 
T" - IE "Ya Д" 7 d 1 


Parenthetic medial vocatives follow the GAE pattern, е | 
final ones are regularly at low pitch with falling contour. i 4 ds 
in GAE follow the pitch of high preceding material ard CA SE E 
contrast between low-rising and low-falling vocatives. (sce bum s 

r, 


1957, p. 45) is absent. 


Summary and outlook 
Hawaiian American English is а unique dialect, of which the most salient | 
prosodic features аге: | 


1. Syllable-timed rhythm, modified by emphatic drawl. 


2. Wide tessitura. 
3. Special registers: газру voice, falsetto. 
4. Scoop on the rise-fall statement (and special-question) tune. 
5. Fluid word-stress and non-information-pointing accent placement. 
6. Specific characteristic intonations: especially a general-question pattern 

with sharp pitch drop contrasting with GAE rise. ү 
a serious barrier to mutual intelligibility with 
these features function as indices of. 
ll as geographical. к 
awaii as elsewhere under the impact 
(G AE) is not the sole \ 
{о тапу Island youth, 


None of these features is 
other Englishes, although of course 
provenience — social and racial as We 

Dialect leveling proceeds apace in Hi 
of television, talkies and travel. But *Haole talk" 

broad Pidgin; nor, 


alternative to monodialectal 
an acceptable one. Arthur Bronstein has wisely stated that in this country 


one’s speech is considered standard * if it reflects the speech patterns of the 
educated persons їп your community" (1960, p. 6). Standard Hawaiian 
American English, regionally marked and distinct from GAE particularly 
in the prosodic features herein described, is spoken by many educated — ' 
Islanders including community leaders, especially those of non-Caucasian j 


descent. 


Every language, and ever! 
which should be studied as such and which cannot 


as mere careless speech ог as a haphazard amalgi 
deviations from the norm. Hawaii’s Pidgin, as 2 
English, is a particularly interesting case In point. 


is a structured system 
be fruitfully regarded 
am of mistakes and 
dialect of American 


y dialect of a language, 


References 
eneral Phonetics, University of Edinburgh 


ABERCROMBIE, D. (1967), Elements of Gi 


Press. á Kai 
Wer (19582), * A theory of pitch accent in English’, Word, vol. 14, 
, D. L. V 
pp. 109-49. ! T i 
Borm E DAL (19586), “Stress and information , American Speech, vol. 33, 


pp. 5-20. 


Ralph Vanderslice and Laura Shun Pierson 449 


RC NEU NERO К ҮТ ҮНД An cl bé 


BRONSTEIN, А. J. (1960), The Pronunciation of American English, New York. 
Carr, E. (1961), ‘Bilingual speakers in Hawaii today’, Social Process, vol. 25, p. 54. 
a CATFORD, J. C. (1964), ‘Phonation types: the classification of some laryngeal 
components of speech production’, in D. Abercrombie ег al. (eds.), In Honour 
.. of Daniel Jones, Longman. 
HALL, R. A. Jr. (1966), Pidgin and Creole Languages, таса, МУ. 
HuLTZÉN, L. S. (1959), ‘Information points in intonation’, Phonetica, vol. 4, 
pp. 107-20, 
Jo ONES, D. (1966), The Pronunciation of English, Cambridge University Press, 

4th edn. 
_ Казром, І. A., and SMITH, M. E. (1960), ‘Pidgin usage of some pre-school 
children i in Hawaii’, Social Process, vol. 24, pp. 63-72. 
\ DEF OGED, P. (1964), A Phonetic Study of West African Languages, Cambridge 
niversity Press. 
2 ay (1960), ‘Communication, a problem of island youth’, Social Progress, 

» p. 46. 

‘Linn, J. (n.d.), ‘Speech improvement in Hawaii", , mimeo по. 5855, Department 
___об Speech, University of Hawaii. 
- PITTENGER, R. E. (1957), ‘Linguistic analysis of tone of voice in communication 
Epor affect’, Psychiatric Research Reports, vol. 8, p. 45. 
TTE СЕК R.E., Носкетт, C. F., and Daneny, J. J. (1960), The First Five 
utes, Ithaca, N.Y. 
is SE J. E. (1969), Language and Dialect in Hawaii: А eu ai 
ry to 935, Honolulu. 
ee E and ToK1MASA, А. (1934), ‘The English dialect of Hawaii’, 
2 _ American Speech, vol. 9, pp. 48-58, 122-31. 
TA LL. (1961), *A study of selected bilingual speakers of English in the 

awaiian Islands’, unpublished thesis, University of Hawaii. 


27 Lorenzo Turner 


Gullah Intonation 


Lorenzo Turner, ‘Gullah intonation’, Africanisms in the Gullah Dialect, 


Chicago University Press, 1949. 


the Gullah Negro's speech appears so strange 
for the first time as its intonation. To under- 
ullah one will have to turn to those West 
by the slaves who were being brought to 
ntinually until practically the beginning of 


the Civil War. Among these tone languages are Mende, Vai, Twi, Fante, 
d a few others. In the discussion that 


Са, Ewe, Yoruba, Ibo, Bini, Efik, an 
will follow, an effort has been made merely to reveal some of the more 
Striking similarities between certain tonal patterns of Gullah and those of 
a few of the West African tone languages. а 

So far as my own observation is concerned, features of tone in Gullah 
are not used as primary phonemes, i.e. the tones of Gullah words do not 


distinguish meanings as do tones in the African tone languages. There are 
in Gullah, however, several intonation patterns, used in sentences, phrases 
and words, that are quite common in the African languages but are not 
used in cultivated English under similar conditions. These tonal patterns 


will be grouped under eight headings. 
The use of a high or mid tone at the end of a declarative sentence 
In an English declarative sentence in which no implication or special 
meaning is intended, the final syllable takes à falling tone if it is stressed 
and a low tone if it is unstressed. In à similar Gullah declarative sentence, 
however, the final syllable frequently takes а high or mid tone, and ie 
syllable may be stressed heavily, ог weakly, or not at all. 
de; tok юп» ћооз his 'kass deme “They talked about how he cursed them” 


do, ‘god; ‘waka ‘It is God's work’ | 
In many West African languages the final syllable of à declarative 
sentence frequently takes 4 high or mid tone when no implication or 


special meaning is intended: h 
«No, I saw only мо, 


hild came" 
k the gua’ 


Probably no characteristic of 
to one who hears this dialect 
stand fully the intonation of G 
African tone languages spoken 
South Carolina and Georgia со! 


Ewe: outen ko, тел Кроа 
деууйу lasvas ‘The € 

&31523.., tits ‘He {00 

Lorenzo Turner 451 


23 de; п,5бу ‘It is near’ 

Оз de, 72592 ‘It is forbidden’ 

` mosroshiüi;kes3i, lozru3k92 miz * mosroshü;kes Ji, is my пате" 
то» 525 25 ‘I told it’ 

п%Зз маз піз 32mõzde3 ‘They came as children? 

оз da:; ‘It is good’ 


7 Оп the final stressed syllable of an English declarative sentence, as already | 
indicated, only a falling tone would be used unless some special meaning 

is intended. In Gullah, on the other hand, as in several West African 
languages, the rising tone is common in this position: 

Gullah: ы, jet, oma '501-3 ‘I tell them so’ 

dat; !flat; 'flDU2-3 ‘That’s flat flour? 

‘mang ny app, n, Jeton Фә; 'waks foy !dema-3 ‘[The] man and 
wife and children are working for them’ 


eskpo3 bo +“ He possesses money,’ lit, ‘He saw, received money’ 
8Е:1-3 ‘It is money’ 

Säz atis des ауђозга ‘He carries a tree on his shoulder’ 
езје з ‘It flies" 

(езе; з ‘He passes’ 

aimi, Оза, 3 ‘I go’ 


The use of level tones — mid, high or low — throughout a statement 


7 Gullah: Yuz \ёоз des ons 'mits səma 'mans 'brakz ‘You go there and meet 


some man broken’ 

'ols lesrts, yuz 'bestos гоз 'hom, роз 'sia ЬФ©зсоз jean, ‘Old 
lady, you'd better go home and see about your children’ 

‘dems "gal, 'kam, ‘homs; Je 'toka Оп»; bp: hiz 'kasy demz 
А "Those girls came home; they talked about how he cursed them" 
S "The occurrence of many level tones in words, phrases and sentences i9 
H . а Соттоп phenomenon in West African languages, 


Osnyes азпуаз uskus ‘covetous person” 
Ösgösnöszgöz ӧззіззіз ‘tall tree" - 
азаа акжа ‘Do not cry’ 
eskwuzze2na, 6,kwu, ‘Stop speaking" 
leama; azja, ‘Let me be looking’ ‚ 
mu, јату та, mpin, йзазйау 240, kiet, “Му friend, let us walk 
together’ і " 


ofEnglish — 
Kr wl f. 


Џ ww T ^w ule A M 
Yoruba: Бараз baba; mi; ‘my grandfather,’ lit. ‘the father of my father’ y 
то» m; baba; rez ‘I knew your father’ Р by 
kaisyes оз giis ‘May the world be straight’ e 
Ewe: лиз las 15931593 ‘the carrying of the thing’ 3 
to3dosdos nus fisaslas ‘obedience to the teacher’ 
me, ga, yi, ‘I went again’ Р 
Vai: опу, #0:10:1800180110:180710101 ‘anything very large ог pon- ` 
derous’ у 


The alternation of low and mid or low and high tones 

throughout a statement 

Gullah: 'аеу mo; 'en; 'nanz 92 
haven't been any of (ће 
"Al, je, a "то "reus; Оз Тот, Ui 2 
house while I leave’ 
'demz da, !са› 9m, 
‘werfy n, 'cilson da !wAka far 
them to the people who have man 
for them’ 

Ibo: 23ko, 62061 ‘He planted coco yam" 
єз bum, aykwaz па; азі “I have brought eggs and yam* 


Efik: nasin, ison, ‘I am laying the floor’ 
eydiwak, 02wo, ‘а crowd of people’ 
Ewe: a,tiszo,tis ‘walking-stick’ 
esde, esme; “Не took its inside out" 
a,tis la, Коз “The tree is high’ 
Yoruba: йуаз miz пзјез Jagigosbi mia 
Jan, гозђ miz" 


nizgba,tis туда Sh 
arrived there, they P. 


dem; ‘bing de; toks "оба рит, ‘There 


m there to talk in a long time' 
ı ‘Take care of my 


do, !И› от di, 'ррћ woh haw, 'manz m ` 
пдет ‘They carry them and give 
and wife and children to work 


"Му mother had the name of? 


des oih, туз Serres Kpuskpo, ‘When they — 


layed a great deal" 


\ 


The use of tones that fall from high to mid 


пута 'hims-2 «1 went there to visit him’ 


Gullah: pr; gonz dez, £D : 
i, ‘лпі tam3-2 “He burned them А 
А ЛАЛЗ-2 ‘onion’ 2 Ar. 

Efik: — ayma3-2 02912 pe2kop2 «If it is good, T will consider it 

Ibo: ^ ejbg:s-2 Ка 2 de koi *Where are you going? * РА 


«Не is good” 


Yoruba: o,f da:3-2 
2'Youm 


оз Ба:з-2 laisyer et her alive? 


+ Lorenzo Turner 453 


e 
e us of tones that rise from Dt ог sët to high 
om роти p mid 


t might occur at the end of an unfinished tonal group — for example, at 
end of a subordinate clause that does not end the sentence; but it does 4 
occur under such conditions as obtain in the following Gullah sen: ` 


ejes азкај > ћђуађ; ‘He goes to the farm’ 
| аутћ ska. ‘I go’ 
Ла, зиз ‘my tree? 

аши gbo3gbo;-3 ‘unripe lime’ 
ruba: | moamõ:1-2 ‘I knew her’ 

| Oro. kpusopo, Är Кйз a2gb3, ‘Many words do not fill a 
basket? 


RE husband’ 

okras ‘okra’ 

be bt, “baby” 

- bant? Ama-2 ‘Burn them’ 
_ æla; ‘ground’ 

mnes ‘mother’ 

dos 0sbi, ‘peace of heart’ 
шаң ‘grave? 

uadi, ‘a town in Nigeria? 

e1 fez ‘shed’ j 
esfe, ‘which’? 
Озаиздиз ‘hole’ a, 

2kras ‘soul’ - | 

Ка; ‘to scatter? 

Каз ‘to touch? | 
y s ‘male’ 
са small birds 


d Y n y 
ee A 


у 
Я 


"The use of a level tone at the end ofa question ` Lat 


In English at the end of a question when по special meaning is implied, 
rising tone is the usual one if yes or ло is required for an answer, and a 
falling tone if it is not. In Gullah, on the other hand, a level tone is quite 
common at the end of a question whether or not yes or ло is required [ог 


an answer. 


Gullah: wots 'demz-ı Ф: 35 yu; “What do they give you?" d 
‘yu; поз WDts dens !pes је; ‘bins? ‘Do you know what they pay ` 


for beans?’ 
'enatis'r €2blz toim, kamzin2 ак 


In the West African languages a level tone is frequently heard at the _ 
conclusion of both types of questions. 3 | 


2 *Tsn't slavery coming back?’ 


Efik: : титоздаг езпуез 92kpo2b01? “Do you think he would have taken _ 
it?’ soft A 
nysisdis пзіакз mamo; imis, kay haz екопг? ‘Why didn't they 
go to war?’ > 

Іо: Кєүйоз ӧзіиз ӧз Sly tiz ге»? “How did ће hit you? É 

Yoruba: tazlog Кэ» Киз, ћуаз г52 tabis Баћа гє? ‘Who was it that died 
first, your mother or your father?’ А 
Кре, из tazni2? * With whom?” d ‘ ^ 
оз пасђез kptilus res? * He is living with you? 


2 “Ате you coming?” 


тоа Ito help you?’ 


maskpes des nuwosa,? ‘Am 


E thanks go to the authors who have permitted their works to be 
^ кеа апа cooperated i in the alterations necessary to make them 


less. to the two whose articles appear here for the first time. 

. The editor is grateful also to those who made suggestions about 

wh hat to include: Isamu Abe, David Crystal, Fred W. Householder Jr, 
and William S.-Y. Wang. 

LJ 

Е Permission to zo uad the following readings in this volume is acknowledged 


Y: 


2 MD rime Ltd 
3 Univers ty of Michigan Press 


1 Didier. (Canada) Ltd 
y 5» E ted States Office of Education | 


à 13 Longman Group Ltd 
14 | thnomusicology 
5 General Gramphone Publications Ltd 
Zeitschrift für Phonetik Sprachwissensch 
ciety of America 
18 Pacific Linguistics 


aft und Kommunikationsforschung 


ournal of the Acoustical Society of America 
d Journal of the Acoustical Society of. America 
4 Phonetica 5. Karger, Basel 
Philologica Scandinavica 
arterly Journal of Speech 
Ў nd Lorenzo Turner 
(e 


CHE 


Author Index 


Abe, I., 314, 343 

Abercrombie, D., 391, 440, 441 

Abramson, A. S., 247, 349 

Adamec, P., 220 

Algeo, J., 50 

Allen, W. S., 88 

Alnæs, J., 415, 425, 426, 427, 431, 433, 
434 

Arisaka, H., 339 

Armstrong; H., 180, 359, 360 

Arnot, W., 175 

Arnold, G. F., 178, 179, 180, 181, 182, 
183, 184, 186, 187, 188, 189, 190, 191 


Bally, C., 339 

Barney, H. L., 368, 369 

Beneš, E., 220 

Berman, A., 87 

Bierwisch, M., 87, 96, 97-8, 107 

Bjerrum, M., 415, 431 

Blom, E., 282, 284 

Bloomfield, L., 194-5 

Bo, A., 415 

Bolinger, D. L., 39, 50, 51, 87, 90, 92, 
97, 186, 198, 200, 205, 227, 294, 339, 
346, 348, 442 | 

Borgstrom, G., 415, 428 

Borst, 1. M., 350 

Bowen, J. D., 90, 91, 92 

Brecht, B., 179, 180, 185 

Bresnan, J. W., 87, 93-6, 98-102, 103 

Bricker, P. D., 248 

Broch, O., 415, 427 

Bronstein, A. J., 348, 449 

Bugge, I., 429 

Buning, J. E. J., 228 : ' 

Burgstahler, P., 35 

Bush, H. C., 345 


Carr, E., 439 
Catford, J. C., 176, 441 


Chang, N.-C. T., 366 

Chao, Y. R., 217, 269, 338, 392, 393 

Chapallaz, M., 315 

Chomsky, N., 50, 87, 90, 91, 92, 93,94, _ 
95, 96, 107, 230 ж 

Classe, A., 77 ) 

Collinson, W. E., 175, 180 

Cooke, D., 304 

Cooper, F. S., 350 

Costeriu, E., 218 

Coustenoble, Н. N., 80 

Crystal, D., 49, 51, 110, 130, 225. 


Danehy, J. J., 444 
Daneš, F., 157, 217, 218 
David, E. E., 248 
Davis, H., 357 

Davy, D., 120 + 
Delattre, P., 26, 155-6 . St 
Denes, P., 248, 348 № 
Deva, B. C., 304 

Downing, B. T., 87, 103-7 Kat 
Dudley, H., 350 5 
Dürrenmatt, F., 179, 180 


Ebeling, С. L., 228 d 
Ekblom, R., 415 

Eliot, G., 189 

Emonds, J. E., 103 
Essen, O. von, 228, 286 


Fairbanks, G.. 370 

Fintoft, K., 414, 415 

Firbas, Ј., 220, 222,226 

Fodor, J. A. 87 

Fonagy, I., 262, 287, 288, 292 
Frisch, M., 179, 181, 185,189 


Galsworthy, J., 189 - 
Gardiner, A. H., 341 
Garding, E., 349 


Be 


Dk A 


Gerstman, L. J., 90 
Giet, F., 385 

Gleason, Н. А.,91,212 

Green, H. C., 36 

Greenberg, J. H., 216, 218, 220 
Gunter, R., 156, 194, 200 
Güttinger, F., 178 


Haas, M., 264, 273 
Hadding, K., 315, 415 


. Hadding-Koch, K., 348, 349 


Hall, R. A., 261 

Halle, M., 50, 87, 90, 91, 92, 94, 95, 96, 
107 

Halliday, M. A. K., 176, 181, 227, 230 

Hansen, A., 415 

Hatcher, A. G., 220 

Hattori, S., 338 

Haugen, E., 336, 414, 416 

Hausenblas, K., 217 

Hermann, E., 348 

Hill, A. A., 87,91 

Hockett, C. F., 83, 84, 91, 92, 94, 95, 
217, 222, 226, 274, 444 

House, A. S., 370 

Householder, Е. W., 92, 271, 273 

Hultzén, L. S., 176, 442 


dis, B. A., 222 


Jakobson, R., 216,218 

James, E. E., 40 

Jassem, V., 179, 180, 181, 187, 188. 
191,227 

Jensen, M. K., 415 

Jespersen, O., 220, 435 

Jimbo, K., 338 

Jones, D., 73, 227, 283, 444 

Joos, M., 36, 366, 416 

Jouve, P. J., 305 


Kasdon, L. A., 440 
Katz, J. J., 87 
Kawakami, I. S., 337 
Kersten, L. G., 248 


458 Author Index 


Kingdon, R., 89, 90, 98, 124, 178, 180, 
181, 183, 185, 186, 188, 210 

Klima, E. S., 196, 230 

Kopp, G. A., 36 

Kruselnickaja, K. G., 222 

Kurath, H., 341 


Ladefoged, P., 90, 91, 92, 107, 441 
Lakoff, G., 87, 88, 98, 99, 102 
Lapteva, O. A., 220 

Larsen, R. S., 313-14 

Lee, W. R., 184, 227, 228 

Lees, R. B., 196 

Lehiste, I., 365, 367, 372 

Léon, P. R., 17 

Levis, J. H., 271 

Liberman, A., 253 

Lieberman, P., 90, 197, 201, 234, 236 
Lind, A., 439 

Lindau, M., 88 

Linn, J., 440 

List, G., 261 

Lord, C., 99 

Lukoff, Е., 95 


McIntosh, A., 227, 230 
Magdics, K., 262, 288 
Malmberg, B., 415 
Martin, P., 17 

Mason, D. G., 282, 283 
Mathesius, V., 217, 218, 222 
Mattheson, Ј., 304 
Meyer, E. A., 45 
Meyer-Eppler, W., 365 
Michaels, S. B., 234 
Milton-Williams, J., 348 
Miyata, K., 337 

Моп те, H., 183 


Newman, Е., 282 
Newman, S., 93, 101 
Novak, P., 220 


O'Connor, J. D., 178, 179, 180, 181, 
182, 183, 184, 186, 187, 188, 189, 
190, 191 


Ofteda, M., 415 
Ohman, S., 415 
Osgood, G. C. E., 250, 253, 254 


Pala, K., 220 

Palmer, H. E., 179, 180, 181, 185, 186, 
283 

Panconcelli-Galzia, G., 385 

Pence, А., 314 

Pestalozzi-Schiirli, A., 175, 191 

Peterson, G. E., 365, 367, 368, 369, 372 

Piersen, L. S., 437 

Pike, E. V., 313-14 

Pike, K. L., 49-50, 60, 84, 90, 93, 176, 
180, 181, 198, 314, 325, 344, 348, 
367,387,415 

Pittenger, К. E., 444, 449 

Pope, E., 87, 96-7, 98, 103 

Porte, J. F., 282 

Potter, R. K., 36 

Pyles, T., 50 


Quirk, R., 110, 120, 130, 225 


Raspopov, E. P., 222 
Reed, W. H., 282, 284 
Reinecke, J. E., 440 


Schneider, W., 191 

Schooneveld, C. H., 228 

Schubiger, M., 150, 156,175, 
188, 190, 228 

Selmer, E. W., 415, 426, 428, 430, 433 

Shaw, C. B., 181 

Shun, L. L., 43 

Silva-Fuenzalida, I., 90, 91 

Sledd, J., 83, 85, 198, 207, 210 


176, 181, 


197-8, 199, 202-3, 210, 367 
Smith, M. E., 440 + 
Smith, S:, 415, 431 
Stalling, N. C., 415, 431 
Stevens, S. S., 357. 

Stockwell, R. P., 49, 50, 87, 90, 91, 92, 

93, 107, 198, 207, 230 
Storm, J., 415, 425, 433 
Straka, G., 35 
Strevens, P., 227, 230 
Suci, G. J., 250, 254 
Sweet, H., 344 
Szamosi, M., 87 


Taguchi, R., 340 _ 
Tannenbaum, P., 250,254 
Terakawa, K., 339: * 4 у 
Tokieda, М., 340 


Tokimasa, A., 440 и Ж 
Trager, G. L., 49, 50, 83, 84, 90, 91, 9 


156, 176, 197-8, 199, 202-3, 210, 367 ) 
Trojan, F., 292, 293, 294, 296, 297, 299, 


301, 302 E) 
Turner, L., 437 
Uhlifova, L., 220 3 d 
Ujfalussy, J.,307 d Н: 
Uldall, E. T., 234, 251, 344, 348-9, 391 | 


Vachek, J., 230 
Vanderslice, К. 90, 
Vanvik, A 415 


91,92,93,437 | 


207, 209, 212, 213, 337, 442, 444 


2120 (in Norwegian), 415, 425-32, 
.. Accent nucleus (in Japanese), 338 
Г Accent language(s), 14 
Accusation, 345 
: llemanic, 185 
А llocontour, 238. 
Allophone, 84 
Allotail, 119 
Allotone, 392, 399, 402 
Ambiguity, 1 139, 159, 166, 196, 197, 
213,214 
л Атегісап English see English 
Amplitude, 31-2, 235, 237, 238-42, 
247, 248 
 Anaphorm, 88, 96, 99, 196 
A d x 60, 286, 291, 301-2, 309, 


Ai rig 289, 298-9, 308 
Р on 338, 345, 433 
nnoyance (vexation „ 404, 410, 
E ), 411, 


137, 138, 139, 142, 145-53, 198, 199, 


Accent 10а Norwegian), 415, 425-32, 
| 434-5 


Basic pattern 1 (in Italian), 359, 361, 
362 

Basic pattern 2 (in Italian), 360, 361, 
362 

Beethoven, 304, 306 

Berg, A., 305, 308 

Bergen, 415 

Bini, 451 

Boredom, 115, 119 

Boundary question, 89, 103-7 

Brahms, 282, 306 

Breathy voice, 288, 289, 291, 292, 294, 
301, 411, 441 

Britain, 51, 261 

British English see English 

Britten, B., 282 

Bruckner, 282, 285 


Cacana, 437 ` 

California University, 392 

Chant, 74, 265, 266, 269—70, 279 

Chest tone, 289, 291, 299, 302 

China, 269, 270, 271, 385, 439 

Chinese, 13, 60, 222, 269, 313, 338-9, 
366, 385, 391-413 

Chengtu (Szechuan), 366, 
391-413 
Mandarin, 393 

Chomsky adjunction, 105-6 

Clitic, 89, 97, 98 

Commands, 139-41, 143-4, 149, 
156, 166, 171—4, 175, 214, 253, 251, 
342-3 

Comment (Rheme), 96-7, 217, 222-7, 
229 

Complaint, 286, 289-90, 299, 308-9, 
311 

Configurations, 17, 49 

Constitution, 221, 222, 224, 225, 226, 
229 

Contempt, 323, 405, 410-13 


| , 


1 


Content, 55, 144-9, 202-6, 212-15, 
221,251 
Continuations, major and minor, 26, 
155, 167-9, 171-4 
Contours, 14, 28, 49, 53-82, 87-103, 
107, 138, 156, 159-61, 166-71, 174, 
176, 198-215, 225-9, 250-58, 
314-15, 319-23, 325-36, 344, 348-57, 
365, 367-83, 415, 418, 425, 426, 428, 
435, 444, 449 
neutral, 87-8, 89, 257-8 
nuclear, 331-5 
primary, 50, 84-6, 142, 144, 147, 149 
total, 65-8, 75-8 
Contour centre, 88-92, 95-9, 100-103 
Contour point, 61, 62, 63, 319-23 
Contour tone see Tones 
Contrastive 
accent, 442-3, 446 
emphasis, 227-8 
stress, 88, 96-7, 99 
Cook, Captain, 439 
Coquetry, 286, 288, 295-6, 306-7 
Cordiality, 342, 345 
Cornell university, 83 
Creaky voice, 302, 393, 401 
Curiosity, 345-6, 361 
Cyclical rules, 95-6, 95 
Czech, 220, 221, 226-30 


Danish, 431 \ 
Felsted, 431 
Debussy, 285, 305-10 
Declarative sentences, 451 
Denmark, 415 
Dialects, 262, 283, 391, 392, 430, 431 
see also English, Chinese and 
Norwegian 
Discourse, 156, 167 
Dismissal of topic, 407, 410, 412-13 
Dominant order, 216 
Dravidian, 304 
Drawl, 441 
Duration, 20-23, 25, 31-2, 34, 315, 
348, 440 ! 
Dutch, 345 


Ewe451-) # 9116 n 
- Exclamations, 156, 167, 1714,34, — 
431, 433 ^ "i 


' Fante, 45 

Faroese, 433 — — 

Fauré, 282, 285 Ў 

Fear, 286, 289, 297-8, 307-8 

Folk song, 271, 310-11 t 

Formants, 11, 237, 385-90 у 
rance, 282 " ` "n 

French, 23, 27,155, 156, 157-74,175, — 
180, 185-6, 188, 262, 291-310, 313, 9$ 


LA 


Ку 
250,391. 


Efik, 451-5 

Elgar, 282-5 

Emic content, 326-7 

Emotion, 11, 15, 20, 24-5, 29, 57, 137, 
199,200,215,233,235,240-44, — 
241-8, 250-58, 261-2, 286-311, Z 
320-21, 339-41, 401, 409-12 

‘Emotional space’, 233 D 

Emphasis, 65, 66, 97, 227-8, 269, 314, a 

L3 


343, 363-4, 403, 409, 412, 434-5 
“ш 


" 


Emphatic approval, 404, 410, 
413 
Enclitics, 340 


England, 282 
English, 11-14, 17, 20-29, 49-51, S 


61-82, 83-6, 87-107, 110-35, 137-53, 
155-7, 168, 175-92, 194-215, 217, 
218, 220, 221, 222, 224, 226,227, 
228-30, 261-2, 283-5, 291-304, 313, | 
315, 325, 327, 340, 341-6, 348-57, 
359, 362, 365, 367-83, 429, 437, _ 
14645125 ANE КК ИШ 
‘American, 27, 83-6, 212, 251, 254, 
< 258, 261, 262, 283, 325, 327, 346, _ 


348-57,367-83 — — 1 
12, 261, 262, 283-5, 


British, 86, 150,2. 


359 
Hawaii, 437, 439-49 


Culian, 437,4515 ^ у 
Indian, 437 KAES 


Falsetto, 60, 334, 413, 432, 441-2 ie 


345,3 


indamental frequency, 11, 12, 13, 15, 
_ 32,33, 34, 37, 40-47, 233, 235-48, 
.— 348-57, 365, 367-83 


German, 27,156, 175-92, 220, 226, 
= 221,229, 262, 291-310, 345, 365, 
EC 


ў деше, 12,203, 204 
Global constraint, 98, 101, 102 


Handel, 304 

_ Harmonics, 31, 36-8, 39, 40, 41, 44, 47, 

368, 377 

skins laboratories, 250, 253, 350,357 

Hawaii 440 ~ 

Hawaiian English see English 

р Head (of tone group), 112, 120-25, 175 

. Head tone, 288, 293, 296, 302 

Hesitation, 72, 344 

High booster, 1 

Huastec, 313-14, 317-24 
Hungarian, 262, 286-94, 299, 301-3, 

.. 306-7, 310-11 

j b 


d Г дА Uu 
к Implication, 169. 

ian English see English 
diana university, 264, 266, 271 
n fiection (of the voice), 283 
nherent stress, 125, 

SEA ‚ 125 see also Innate 


6-7, 185-8, 191 see also 


› 34, 36, 315, 348, 38 
H _see also Loudness Ф. T 
 "Pttrogation, 167, 172-4, 390 see also 
E estion ` r 

D 


Intoneme see Phoneme, Intonation 
Inversion, 218, 220, 223-5 

Тгопу, 119, 338, 345 

Italian, 308, 315, 359-64 

Italy, 282 


Japan, 338, 439 

Japanese, 223, 314, 337-46 

Joy, 286-7, 292, 304 

Juncture, 112, 415 see also Phoneme, 
Juncture 


Kodaly, 308, 311 

Korea, 438 

Korean, 222 

Kymograph, 17, 30-35, 391-2, 415, 
416 


Length, 49, 112 see also Duration 

Lexical meaning, 54-7, 60, 196 

Lexical stress, 63-4, 72, 75, 80 

Liveliness, 363 

Logical structure, 101-2 

Longing, 286, 287-8, 294-5, 306, 
310-11 

Loudness, 20-21, 25, 110, 152, 287-93, 
296-7, 440 see also Intensity 


Mahler, 282, 285 
Margin, 325 
Markeoness, 176, 218-23, 225 
Massachusetts Institute of 
Technology, 93, 98, 100 
Maya (Yucatan), 324 
Mazatec, 13 
‘Meaning question’, 90 
Measure, 427-8, 432, 434-5 
Melodic analyser, 17, 39-47 
Menoe, 451 
Minimal pairs, 161, 166, 168, 171 
Mixtec, 60 
Monteverdi, 305, 306, 308 
Mora, 338 Y 
Morpheme, 60, 68, 320 
intonation, 320 
pitch, 367 


H 


Morphological stress, 92 

Moussorgsky, 308-10 

Mozart, 305-10 

Music, 11, 15, 261-2, 263-80, 282-5, 
286-311, 365 

Musical intervals, 377, 378, 382, 383, 
418-25 


Navato, 60 
Neutrality, 176, 218-23, 225 
New Guinea, 314, 325, 335 
Noise (spectral), 385, 387, 396 
Norwegian, 343, 366, 414-36 
east, 343, 366, 414-36 
north-west, 430 
south-west, 430-32 
| west, 430 
Nucleus, 51, 111-31, 175-91, 226, 325, 
415, 428-35 
Nuclear stress, 92, 331 
Nuclear stress rule, 94-7, 102-3, 107 
‘Nuclear stress question’, 39, 93-103 
Nuclear tones 
` simple, 113-16 
complex, 116-17 
compound, 117-18 


Onset, 111, 120, 122, 125 
Oscilloscope, 35-5, 237 
Oscillograph, 42, 45-6 

Overtones, 11, 19 see also Harmonics 
Oto, 54 : 


Papua, 325 
Paralanguage, 176 
Parataxis, 78-9 
Parenthesis, 25-6, 137-8, 118 361 
Particles, 175-92, 340-42, 3 
Pause, 41, 68-72, 76, 78, 79, S 95,97, 
98, 112, 129 
final, 68-71 
terminal, 68-71 
Perturbation (tonal), 392; 
Phillipines, 439 4 


У ~ 
Phoneme, 11, 15, 31/615 84: Aan" E Se e а 


i 171, 174, 197255. ‚8318-24, v 


ele VE AAR e 
juncture, 83,209,215 _ : 
length, 263, 317-18, 319, 324 _ 
pitch, 61, 84-5, 160, 197, 199, 201, 

206, 215, 318-24 
segmental, 320 
terminal, 83-6, 137, 171-2, 197, 198, 
201, 206, 367 

Pidgin see English, Hawaiian 
Pitch accent see Accent ` 
Pitch see Fundamental frequency. 
Pitch levels, 14, 17, 49, 111, 120-2] 
131-57, 166-9, 171, 251, 257-8, 
401, 408-12, 435 see also Phoneme Wy 
pitch mu. 
Pitch range, 110, 113-14, 116-36, 137, 
251, 258, 272-4, 283-4, 286-311, 
363, 401, 408-12, 418, ‚425, 441 4 
` Pleading, 345 AN 
Polite request, 314 
Potential contour point, 3 318. 
Povo synthesiser, 235, 237, 238 
Prague school, 157, 216,429 
Precontour, 67-8, 72, 
201, 206, 215, 319, 
349, 379, 380, 382, 
Pre-head, 112, 125-6, 131, 
Pre-nucleus, 175-6, p 182,1 
Sea séit", 
Protest, 0413 _ 


Questions, 11; 14, 423.8, 
140-43, 144, 1, 
188, 189,212, 


171-3, 175, 177, 1 
231,253, 257-8, 283, 297, 314-15, 

320 22, 342, 344, 346,348-57, 
359-61, 403, 408-9 412- 13, 432, d 
KEE EE S 
Ee 212, aS 283,29 


36060 


Raspy voice, 441-2 ` 
Ravel, 285 ae 


7 'essive OF! дет, 
EE En. паз ^. 


see 265- 


Relevance, 198, 215 

“Representation question’, 89-93 

Requests, 177,342-3 | 

Responses, 194—9, 201-4, 206, 213-15 

Retorts, 177 

Rhythm, 22-3, 49, 68, 72-4, 82, 440-42 

Rhythm units, 67, 71-4, 76, 78, 82 

Rhythmicality, 110 

Root sentences, 103—4, 106 

Rubbra, 282 

Russian, 175, 216, 218-21, 226-9, 340, 
342, 345 


Sarcasm, 115, 119, 286, 291, 302-4, 
309-10 

Satellite, 415, 428, 432-5 

Schumann, 304, 306 

Scoop, 441, 444-5, 448 

Scorn, 286, 290-91, 299-301, 309 

Semantic differential, 233, 250-51 

Swallow structure, 101 

Sister adjunction, 105-6 

Sing-song, 269-70 

Spanish, 23, 27, 60, 74, 314, 345, 444 

Mexican, 444 

Spectrograph, 34, 35-40, 41, 44, 46-7, 
247-8, 271-4, 348-9, 368, 379, 
385-90, 391, 392, 400, 414, 416, 431 

Statement, 49, 58-9, 140, 166, 175, 177, 
178, 182, 184, 186, 187, 253, 257, 314, 
321, 341, 343, 346, 348-57, 402, 
408-10, 412, 444 

Stress, 13-14, 17, 20-24, 49-50, 63, 84, 
91-2, 110, 118, 131, 178, 180, 182, 
183-4, 186, 226-7, 286-98, 301, 
317-18, 348, 359-64, 368, 376, 383, 
399, 401, 402, 412, 414, 416, 425-7, 
429-30, 432-5, 440, 451 

Surface structure, 101 

Summer Institute of Linguistics, 49, 60, 
314, 325 

Surprise, 286, 288-9, 296-7, 307, 320 


464 Subject Index 


Swedish, 315, 348-57, 366, 415, 435 
Finnish, 435 
Syllable nucleus, 365, 367-83 


Tail, 112, 119, 127, 131, 182-4 

Tangents, 139, 142-8 

Tenderness, 286, 287, 292-3, 304-6, 311 

Tessitura, 440-42 

Terminals see Contour; Terminal; 
Phoneme, terminal 

Thai, 247, 261, 263-80 

Tonal collocation, 126 

Tonal reduplication, 128 

Tones, register and contour, 261, 
264-75 

Tonal languages, 13, 60, 250, 261, 263, 
313, 339, 367, 365-6, 385, 401 

Tone Sandhi, 60, 366, 392-400, 413 

Tone unit, 111-35 

Toneme, 13, 391-413 

Tonicity, 111, 113, 119 

Topic (theme), 96-7, 217, 222-9 

Toronto university, 40 

“Tune 1’ (in English), 359 

“Tune 2’ (in English), 360 

Unaccented (unstressed), syllables, 92, 
125, 137, 138, 148, 149-153, 251, 
254, 359, 360, 376, 383, 431, 451 


Vai, 451, 453 

Vaughan Williams, 282 

Verdi, 306, 308 

Vorack synthesizer, 353 

Vocal cords, 11, 19, 245, 290-91, 442 
Vocative, 170, 445, 448-9 

Voconer, 238, 247, 350 


Wagner, 304-6, 309-10 


Whisper, 247, 288, 365, 385-90 
Word order, 156-7, 216-30 


Yoruda, 13, 342, 451-5 


Ke ` я и 
br e 


ai 


"МУ" 
[29 


D 


E 


К == or the use of speech to 

communicate meaning in language, has been called the 
&reasy' part of speech. Only recently, with advances in 
techniques of analysis, has much headway been made in 
understanding the complexity of its form and function. 
Intonation can now be seen, along with other ‘prosodic’ 
features of language, such as stress and rhythm, as relevant 
tothetheory of Syntax, semantics and linguistic theory in 
Beneral, as wellastothe study of pronunciation by linguists, 
language teachers and others. 


This book is the first collection of papers on this topic to 
арреаг. It presents intonation from a number of different 
points of view, examining the various theoretical approaches 
which have been made, and then looking specifically atthe 
relationship between intonation and grammar, emotional 
expression and music. Research into the acoustic analysis 
of intonation is represented, and thereare sectionsonthe 


comparative analysis of intonation, in differentlanguages 
and styles, 


id Bolinger is Professor of Romance Language and 
iterature at Harvard University. His contributions to the 
study of intonation, dating back over twenty years, have 


ee i fthe 
subject. que importance in the development o 


© 9e99oso vi 


5 
9 
Б 
u 
© 
Є 
о 
= 
= 
z 
Q 
= 
si 
2 


| 


WW38O3 13V] 


p Hl 


D SOMUSINONIN? 39v09NY 


